Secondary Structure / Gap Penalty Masks

 

Secondary structure-based penalties ´Â sequence alignmentÀÇ Á¤È®¼ºÀ» ÁõÁø½ÃŰ´Â °ÍÀ¸·Î ¾Ë·ÁÁ®ÀÖ´Ù. Clustal X´Â ÀÌÁ¦ profile alignment¿¡ »ç¿ëµÇ´Â ÀԷ¼­¿­µé¿¡ secondary structure/ gap penalty masks¸¦ Á¦°øÇϰí ÀÖ´Ù.  (ÁÖÀÇ, secondary structure informationÀº multiple sequence alignment¿¡¼­´Â »ç¿ëµÇÁö ¸øÇÑ´Ù). MaskµéÀº ƯÁ¤ÇÑ ¿µ¿ªµé (typically secondary structure elements)¿¡¼­ gap penaltyµéÀ» ¿Ã¸²À¸·Î½á gapµéÀÌ ´ú º¸Á¸µÈ ¿µ¿ªµé (typically surface loops)¿¡ »ý±âµµ·Ï ÇÑ´Ù.

 

USE PROFILE 1(2) SECONDARY STRUCTURE / GAP PENALTY MASK optionsÀº profile alignment µ¿¾È ÀÔ·ÂÇÑ 2D-±¸Á¶ Á¤º¸ ¶Ç´Â gap penalty maskµéÀÌ  »ç¿ëµÇ¾îÁú °ÍÀÎÁö¸¦ Á¶ÀýÇÑ´Ù.

 

OUTPUT optionsÀº Clustal X output alignments¿¡ secondary structure¿Í gap penalty masksÀÌ Æ÷Ç﵃ °ÍÀÎÁö¸¦ Á¶ÀýÇÑ´Ù. µÎ °³¸¦ ´Ù º¸¿©ÁÖ´Â °ÍÀº masks°¡ ¾î¶»°Ô ÀÛ¿ëÇÏ´ÂÁö¸¦ ÀÌÇØÇϴµ¥ À¯¿ëÇÏ´Ù. 2D-structure information´Â ±× ÀÚü·Î alignment quality¸¦ Æò°¡ÇÏ°í ¾î¶»°Ô residue conservation patternsÀÌ ÀÌÂ÷±¸Á¶¿¡ µû¶ó º¯ÇÏ´ÂÁö¸¦ º¸´Âµ¥ À¯¿ëÇÏ´Ù.

 

HELIX and STRAND GAP PENALTY optionsÀº  core Alpha Helical (A) and Beta Strand (B) residues¿¡ gap penalty¸¦ ¿Ã¸®´Âµ¥ ÇÊ¿äÇÑ °ªÀ» Á¦°øÇÑ´Ù.  CLUSTAL format¿¡¼­ capital residues´Â A and B core structure Ç¥½Ã¸¦ ³ªÅ¸³½´Ù. Basic gap penalties´Â ÁöÁ¤µÈ ¾ç¿¡ ÀÇÇØ °öÇØÁø´Ù.

 

LOOP GAP PENALTY optionÀº Loopµé¿¡¼­ gap penalty¿¡ ´ëÇÑ °ªÀ» Á¦°øÇÑ´Ù. ÀÌ penalty´Â ¿Ã¸®Áö ¸øÇϵµ·Ï ¼³Á¤µÇ¾î ÀÖ´Ù. CLUSTAL format¿¡¼­ loopµéÀº secondary structure Ç¥½Ã¿¡¼­ "."·Î ÁöÁ¤µÈ´Ù.

 

SECONDARY STRUCTURE TERMINAL PENALTY ´Â 2Â÷±¸Á¶ÀÇ ¸»´Üµé¿¡ gap penalty¸¦ Á¤ÇÏ´Â °ªµéÀ» Á¦°øÇÑ´Ù. ÀÌÂ÷±¸Á¶ÀÇ ¸»´ÜµéÀº ´Ù¸¥ ±¸Á¶µé°ú ºñ±³ÇÒ ¶§ ±æ¾îÁö°Å³ª ª¾ÆÁö´Â °ÍÀ¸·Î ¾Ë·ÁÁ® ÀÖ´Ù. µû¶ó¼­ core penaltyµéº¸´Ù´Â ³·Àº Áß°£ °ªÀÌ ¼³Á¤µÇ¾î ÀÖ´Ù. CLUSTAL format¿¡¼­ ¼Ò¹®ÀÚ·Î ÀÐÇôÁö´Â ¸ðµç ÀÌÂ÷±¸Á¶´Â °¨¼ÒµÈ terminal penalty¸¦ °¡Áø´Ù.

 

HELIX and STRAND TERMINAL POSITIONS optionsÀº Áß°£ penaltyµéÀ» °¡Áö´Â ±¸Á¶ ¸»´ÜµéÀÇ ¹üÀ§¸¦ ÁöÁ¤Çϵµ·Ï ÇØÁØ´Ù. Alignment output¿¡¼­ À̵éÀº ¼Ò¹®ÀڷΠǥ½ÃµÈ´Ù. Alpha ³ª¼±µé¿¡ ´ëÇØ¼­ ±× ¹üÀ§´Â end-helical turn (3 residues)·Î ¼³Á¤µÇ¾î ÀÖ´Ù. Beta °¡´Úµé¿¡¼­´Â ¼³Á¤°ªÀÌ ¸»´Ü Àܱâ¿Í ÀÌ¿ôÇÏ´Â loop Àܱâµé¿¡ °ÉÃÄÀÖ´Ù. ¿Ö³ÄÇÏ¸é ¼­¿­º¸Á¸Àº ÀÚÁÖ ½ÇÁ¦ ¼ö¼Ò°áÇÕÀ» ÀÌ·ç°í ÀÖ´Â Beta °¡´ÚÀ» Áö³ª È®ÀåµÇ¾î ³ªÅ¸³ª±â ¶§¹®ÀÌ´Ù.

 

Clustal X´Â SWISS-PROT, CLUSTAL or GDE format input files·ÎºÎÅÍ maskµéÀ» ÀÐÀ» ¼ö ÀÖ´Ù. ¸¹Àº 3-D protein structures¿¡ ´ëÇÑ ÀÌÂ÷±¸Á¶ Á¤º¸´Â SWISS-PROT database entryµéÀÇ Æ¯¼ºÇ¥¿¡ ±â·ÏµÇ¾î ÀÖ´Ù. Ç×»ó assignmentµéÀÌ Á¤È®ÇÑÁö Á¡°ËÇ϶ó - ¾î¶² °ÍÀº ¸Å¿ì ºÎÁ¤È®ÇÏ´Ù. Clustal X looks for SWISS-PROT HELIX and STRAND assignments e.g.

 

 

FT   HELIX       100    115

FT   STRAND      118    119

 

±¸Á¶¿Í penalty maskµéÀº CLUSTAL alignment format¿¡¼­ "!SS_" ¶Ç´Â "!GM_"·Î ½ÃÀÛÇÏ´Â comment linesÀ» °¡Áø´Ù.

¿¹¸¦ µé¾î,

 

!SS_HBA_HUMA    ..aaaAAAAAAAAAAaaa.aaaAAAAAAAAAAaaaaaaAaaa.........aaaAAAAAA

!GM_HBA_HUMA    112224444444444222122244444444442222224222111111111222444444

HBA_HUMA        VLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHGK

 

Mask ±× ÀÚü´Â 1°ú 9 »çÀÌÀÇ ¼ýÀÚµéÀÇ ¼¼Æ®·Î¼­ °¢°¢Àº °°Àº Çà¿¡ ÀÖ´Â Àܱâµé¿¡ ÁöÁ¤µÇ¾î ÀÖ´Ù.

 

GDE flat file format¿¡¼­ masks´Â text·Î ÁöÁ¤µÇ¾î ÀÖ°í ±× À̸§Àº "SS_ ¶Ç´Â "GM_·Î ½ÃÀ۵Ǿî¾ß¸¸ ÇÑ´Ù.

 

±¸Á¶ ¶Ç´Â penalty mask ¶Ç´Â µÑ ´Ù°¡ »ç¿ëµÉ ¼öµµ ÀÖ´Ù. ¸¸¾à µÑ ´Ù°¡ ¹è¿­¿¡ Æ÷ÇÔµÇ¸é ¾î´À °ÍÀ» »ç¿ëÇÒ °ÍÀÎÁö Áú¹®À» ¹Þ°ÔµÈ´Ù.