collagen, Gsp_012135 (mRNA) Girardia sp.

Overview
Namecollagen
Unique NameGsp_012135
TypemRNA
OrganismGirardia sp. (Girardia sp.)
Sequence length8270
Analyses
This mRNA is derived from or has results from the following analyses
Analysis NameDate Performed
Girardia sp. Transcriptome2016-03-03
Girardia Sp Translation2016-04-28
Girardia Sp. BLASTX Human2016-03-08
Girardia Sp. BLASTX Swissprot Uniprot2016-03-08
Girardia Sp. BLASTX Drosophila melanogaster2016-03-09
Girardia Sp. BLASTX Schmidtea mediterranea2016-03-09
Girardia Sp CDS2016-05-06
Homology
BLAST of collagen vs. RefSeq Human
Match: gi|767973237|ref|XP_011536230.1| (PREDICTED: collagen alpha-1(II) chain isoform X1 [Homo sapiens])

HSP 1 Score: 117.472 bits (293), Expect = 4.849e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = 3
Query: 7530 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
            +I++P+G+++ PARTC+ L   +P    G+YWIDPN G T DA+KVFC +   +TC+ P    +  ++W S  +  K H W    +N         D +  +  N Q+ FL++ S   +Q IT+ C+N      E   N   A  +   +D  I+    S+  YT +KD C        +++IE +    S LPI DI    I   + +FG+ I  VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534          

HSP 2 Score: 117.472 bits (293), Expect = 4.849e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = -3
Query:  121 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 741
            +I++P+G+++ PARTC+ L   +P    G+YWIDPN G T DA+KVFC +   +TC+ P    +  ++W S  +  K H W    +N         D +  +  N Q+ FL++ S   +Q IT+ C+N      E   N   A  +   +D  I+    S+  YT +KD C        +++IE +    S LPI DI    I   + +FG+ I  VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534          
BLAST of collagen vs. RefSeq Human
Match: gi|767973239|ref|XP_011536231.1| (PREDICTED: collagen alpha-1(II) chain isoform X1 [Homo sapiens])

HSP 1 Score: 117.472 bits (293), Expect = 4.849e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = 3
Query: 7530 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
            +I++P+G+++ PARTC+ L   +P    G+YWIDPN G T DA+KVFC +   +TC+ P    +  ++W S  +  K H W    +N         D +  +  N Q+ FL++ S   +Q IT+ C+N      E   N   A  +   +D  I+    S+  YT +KD C        +++IE +    S LPI DI    I   + +FG+ I  VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534          

HSP 2 Score: 117.472 bits (293), Expect = 4.849e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = -3
Query:  121 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 741
            +I++P+G+++ PARTC+ L   +P    G+YWIDPN G T DA+KVFC +   +TC+ P    +  ++W S  +  K H W    +N         D +  +  N Q+ FL++ S   +Q IT+ C+N      E   N   A  +   +D  I+    S+  YT +KD C        +++IE +    S LPI DI    I   + +FG+ I  VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534          
BLAST of collagen vs. RefSeq Human
Match: gi|767973241|ref|XP_011536232.1| (PREDICTED: collagen alpha-1(II) chain isoform X1 [Homo sapiens])

HSP 1 Score: 117.472 bits (293), Expect = 4.849e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = 3
Query: 7530 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
            +I++P+G+++ PARTC+ L   +P    G+YWIDPN G T DA+KVFC +   +TC+ P    +  ++W S  +  K H W    +N         D +  +  N Q+ FL++ S   +Q IT+ C+N      E   N   A  +   +D  I+    S+  YT +KD C        +++IE +    S LPI DI    I   + +FG+ I  VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534          

HSP 2 Score: 117.472 bits (293), Expect = 4.849e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = -3
Query:  121 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 741
            +I++P+G+++ PARTC+ L   +P    G+YWIDPN G T DA+KVFC +   +TC+ P    +  ++W S  +  K H W    +N         D +  +  N Q+ FL++ S   +Q IT+ C+N      E   N   A  +   +D  I+    S+  YT +KD C        +++IE +    S LPI DI    I   + +FG+ I  VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534          
BLAST of collagen vs. RefSeq Human
Match: gi|767973243|ref|XP_011536233.1| (PREDICTED: collagen alpha-1(II) chain isoform X1 [Homo sapiens])

HSP 1 Score: 117.472 bits (293), Expect = 4.849e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = 3
Query: 7530 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
            +I++P+G+++ PARTC+ L   +P    G+YWIDPN G T DA+KVFC +   +TC+ P    +  ++W S  +  K H W    +N         D +  +  N Q+ FL++ S   +Q IT+ C+N      E   N   A  +   +D  I+    S+  YT +KD C        +++IE +    S LPI DI    I   + +FG+ I  VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534          

HSP 2 Score: 117.472 bits (293), Expect = 4.849e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = -3
Query:  121 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 741
            +I++P+G+++ PARTC+ L   +P    G+YWIDPN G T DA+KVFC +   +TC+ P    +  ++W S  +  K H W    +N         D +  +  N Q+ FL++ S   +Q IT+ C+N      E   N   A  +   +D  I+    S+  YT +KD C        +++IE +    S LPI DI    I   + +FG+ I  VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534          
BLAST of collagen vs. RefSeq Human
Match: gi|767973245|ref|XP_011536234.1| (PREDICTED: collagen alpha-1(II) chain isoform X1 [Homo sapiens])

HSP 1 Score: 117.472 bits (293), Expect = 4.849e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = 3
Query: 7530 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
            +I++P+G+++ PARTC+ L   +P    G+YWIDPN G T DA+KVFC +   +TC+ P    +  ++W S  +  K H W    +N         D +  +  N Q+ FL++ S   +Q IT+ C+N      E   N   A  +   +D  I+    S+  YT +KD C        +++IE +    S LPI DI    I   + +FG+ I  VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534          

HSP 2 Score: 117.472 bits (293), Expect = 4.849e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = -3
Query:  121 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 741
            +I++P+G+++ PARTC+ L   +P    G+YWIDPN G T DA+KVFC +   +TC+ P    +  ++W S  +  K H W    +N         D +  +  N Q+ FL++ S   +Q IT+ C+N      E   N   A  +   +D  I+    S+  YT +KD C        +++IE +    S LPI DI    I   + +FG+ I  VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534          
BLAST of collagen vs. uniprot
Match: gi|18202526|sp|Q28668|CO1A2_RABIT (RecName: Full=Collagen alpha-2(I) chain; AltName: Full=Alpha-2 type I collagen; Flags: Precursor)

HSP 1 Score: 129.798 bits (325), Expect = 1.693e-29
Identity = 115/315 (36.51%), Postives = 159/315 (50.48%), Query Frame = 3
Query: 7293 QGPVGITGPRGD---PGIPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXXIM----------AMQLRSPT--KGVVYGDDPAAAELLGNNAIKN---PQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSW-NSPITKGHTWLKSLLNSD-EIQYSIPN-------GQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDT-IIKMTQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
            +GP G TGP G     G PG +GP GL G +GS GP G  GP GPPGPPG  GG            A Q RSP   +   Y  D     L  NN I+    P+G+++ PARTC+ L   +P    G YWIDPN G T DA+KV+C  S  +TCI+   + IS+++W  S   K H WL   +N   + +Y++          QLAF+++ ++ A+Q IT+ C+N      EE  N   A  L   +D  ++    S+  YTV+ D C    +   ++IIE K +  S LP  DI    I     +F + +  VCF
Sbjct:  213 RGPAGPTGPAGKDGRSGHPGTVGPAGLRGSQGSQGPAGPPGPPGPPGPPGASGGGYDFGYDGDFYRADQPRSPPSLRPKDYEVDATLKSL--NNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCIRAQPENISVKNWYKSSKAKKHVWLGETINGGTQFEYNVEGVTSKEMATQLAFMRLLANHASQNITYHCKNSIAYMDEETGNLNKAVILQGSNDVELVAEGNSRFTYTVLVDGCTKKTNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFYVDVGPVCF 525          

HSP 2 Score: 129.798 bits (325), Expect = 1.693e-29
Identity = 115/315 (36.51%), Postives = 159/315 (50.48%), Query Frame = -3
Query:  121 QGPVGITGPRGD---PGIPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXGIM----------AMQLRSPT--KGVVYGDDPAAAELLGNNAIKN---PQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSW-NSPITKGHTWLKSLLNSD-EIQYSIPN-------GQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDT-IIKMTQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 978
            +GP G TGP G     G PG +GP GL G +GS GP G  GP GPPGPPG  GG            A Q RSP   +   Y  D     L  NN I+    P+G+++ PARTC+ L   +P    G YWIDPN G T DA+KV+C  S  +TCI+   + IS+++W  S   K H WL   +N   + +Y++          QLAF+++ ++ A+Q IT+ C+N      EE  N   A  L   +D  ++    S+  YTV+ D C    +   ++IIE K +  S LP  DI    I     +F + +  VCF
Sbjct:  213 RGPAGPTGPAGKDGRSGHPGTVGPAGLRGSQGSQGPAGPPGPPGPPGPPGASGGGYDFGYDGDFYRADQPRSPPSLRPKDYEVDATLKSL--NNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCIRAQPENISVKNWYKSSKAKKHVWLGETINGGTQFEYNVEGVTSKEMATQLAFMRLLANHASQNITYHCKNSIAYMDEETGNLNKAVILQGSNDVELVAEGNSRFTYTVLVDGCTKKTNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFYVDVGPVCF 525          
BLAST of collagen vs. uniprot
Match: gi|115286|sp|P02460|CO2A1_CHICK (RecName: Full=Collagen alpha-1(II) chain; AltName: Full=Alpha-1 type II collagen; Flags: Precursor, partial [Gallus gallus])

HSP 1 Score: 118.242 bits (295), Expect = 3.468e-26
Identity = 72/220 (32.73%), Postives = 111/220 (50.45%), Query Frame = 3
Query: 7530 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKG--HTWLKSLLNSDEIQYSI------PNG---QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMT-QSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
            +I++P+G+K+ PARTC+ +   +P    G+YWIDPN G T DA+KVFC +   +TC+ P    I  ++W +  TK   H W    +N     +S       PN    Q+ FL++ S   +Q +T+ C+N      EE  N   A  +   +D  I+    S+  Y+V++D C        +++IE +    S LPI DI    I     +FG+ I  VCF
Sbjct:  150 SIRSPEGSKKNPARTCRDIKLCHPEWKSGDYWIDPNQGCTLDAIKVFCNMETGETCVYPTPSSIPRKNWWTSKTKDKKHVWFAETINGG-FHFSYGDENLSPNTASIQMTFLRLLSTEGSQNVTYHCKNSIAYMDEETGNLKKAILIQGSNDVEIRAEGNSRFTYSVLEDGCTKHTGKWGKTVIEYRSQKTSRLPIVDIAPMDIGGADQEFGVDIGPVCF 368          

HSP 2 Score: 118.242 bits (295), Expect = 3.468e-26
Identity = 72/220 (32.73%), Postives = 111/220 (50.45%), Query Frame = -3
Query:  121 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKG--HTWLKSLLNSDEIQYSI------PNG---QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMT-QSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 741
            +I++P+G+K+ PARTC+ +   +P    G+YWIDPN G T DA+KVFC +   +TC+ P    I  ++W +  TK   H W    +N     +S       PN    Q+ FL++ S   +Q +T+ C+N      EE  N   A  +   +D  I+    S+  Y+V++D C        +++IE +    S LPI DI    I     +FG+ I  VCF
Sbjct:  150 SIRSPEGSKKNPARTCRDIKLCHPEWKSGDYWIDPNQGCTLDAIKVFCNMETGETCVYPTPSSIPRKNWWTSKTKDKKHVWFAETINGG-FHFSYGDENLSPNTASIQMTFLRLLSTEGSQNVTYHCKNSIAYMDEETGNLKKAILIQGSNDVEIRAEGNSRFTYSVLEDGCTKHTGKWGKTVIEYRSQKTSRLPIVDIAPMDIGGADQEFGVDIGPVCF 368          
BLAST of collagen vs. uniprot
Match: gi|82202407|sp|Q6P4Z2|CO2A1_XENTR (RecName: Full=Collagen alpha-1(II) chain; AltName: Full=Alpha-1 type II collagen; Flags: Precursor)

HSP 1 Score: 122.094 bits (305), Expect = 4.162e-26
Identity = 78/221 (35.29%), Postives = 112/221 (50.68%), Query Frame = 3
Query: 7530 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKG----HTWLKSLLNSD-EIQY----SIPNG---QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
            +I++P GTK+ PARTC+ L   +P    G+YWIDPN G T DA+KVFC +   +TC+ P   +I  ++W S   KG    H W    +N   +  Y    S PN    Q+ FL++ S  ATQ IT+ C+N      E   N   A  L   +D  I+    S+  Y  ++D C+      ++++IE +    S LPI DI    I     +FG+ I  VCF
Sbjct: 1273 SIRSPDGTKKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTVDAIKVFCNMETGETCVYPNPSKIPKKNWWS--AKGKEKKHIWFGETINGGFQFSYGDDSSAPNTANIQMTFLRLLSTDATQNITYHCKNSIAFMDEASGNLKKAVLLQGSNDVEIRAEGNSRFTYNALEDGCKKHTGKWSKTVIEYRTQKTSRLPIVDIAPMDIGGADQEFGVDIGPVCF 1491          

HSP 2 Score: 122.094 bits (305), Expect = 4.162e-26
Identity = 78/221 (35.29%), Postives = 112/221 (50.68%), Query Frame = -3
Query:  121 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKG----HTWLKSLLNSD-EIQY----SIPNG---QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 741
            +I++P GTK+ PARTC+ L   +P    G+YWIDPN G T DA+KVFC +   +TC+ P   +I  ++W S   KG    H W    +N   +  Y    S PN    Q+ FL++ S  ATQ IT+ C+N      E   N   A  L   +D  I+    S+  Y  ++D C+      ++++IE +    S LPI DI    I     +FG+ I  VCF
Sbjct: 1273 SIRSPDGTKKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTVDAIKVFCNMETGETCVYPNPSKIPKKNWWS--AKGKEKKHIWFGETINGGFQFSYGDDSSAPNTANIQMTFLRLLSTDATQNITYHCKNSIAFMDEASGNLKKAVLLQGSNDVEIRAEGNSRFTYNALEDGCKKHTGKWSKTVIEYRTQKTSRLPIVDIAPMDIGGADQEFGVDIGPVCF 1491          
BLAST of collagen vs. uniprot
Match: gi|146286085|sp|Q91717|CO2A1_XENLA (RecName: Full=Collagen alpha-1(II) chain; AltName: Full=Alpha-1 type II collagen; Flags: Precursor [Xenopus laevis])

HSP 1 Score: 120.939 bits (302), Expect = 1.212e-25
Identity = 80/226 (35.40%), Postives = 113/226 (50.00%), Query Frame = 3
Query: 7524 NNAIKN---PQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKG----HTWLKSLLNSD-EIQY----SIPNG---QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
            NN I+N   P GTK+ PARTC+ L   +P    G+YWIDPN G T DA+KVFC +   +TC+ P   +I  ++W S   KG    H W    +N   +  Y    S PN    Q+ FL++ S  A+Q IT+ C+N      E   N   A  L   +D  I+    S+  Y  ++D C+      ++++IE +    S LPI DI    I     +FG+ I  VCF
Sbjct: 1262 NNQIENIRSPDGTKKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTVDAIKVFCDMETGETCVYPNPSKIPKKNWWS--AKGKEKKHIWFGETINGGFQFSYGDDSSAPNTANIQMTFLRLLSTDASQNITYHCKNSIAFMDEASGNLKKAVLLQGSNDVEIRAEGNSRFTYNALEDGCKKHTGKWSKTVIEYRTQKTSRLPIVDIAPMDIGGADQEFGVDIGPVCF 1485          

HSP 2 Score: 120.939 bits (302), Expect = 1.212e-25
Identity = 80/226 (35.40%), Postives = 113/226 (50.00%), Query Frame = -3
Query:  121 NNAIKN---PQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKG----HTWLKSLLNSD-EIQY----SIPNG---QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 747
            NN I+N   P GTK+ PARTC+ L   +P    G+YWIDPN G T DA+KVFC +   +TC+ P   +I  ++W S   KG    H W    +N   +  Y    S PN    Q+ FL++ S  A+Q IT+ C+N      E   N   A  L   +D  I+    S+  Y  ++D C+      ++++IE +    S LPI DI    I     +FG+ I  VCF
Sbjct: 1262 NNQIENIRSPDGTKKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTVDAIKVFCDMETGETCVYPNPSKIPKKNWWS--AKGKEKKHIWFGETINGGFQFSYGDDSSAPNTANIQMTFLRLLSTDASQNITYHCKNSIAFMDEASGNLKKAVLLQGSNDVEIRAEGNSRFTYNALEDGCKKHTGKWSKTVIEYRTQKTSRLPIVDIAPMDIGGADQEFGVDIGPVCF 1485          
BLAST of collagen vs. uniprot
Match: gi|8039779|sp|P02465|CO1A2_BOVIN (RecName: Full=Collagen alpha-2(I) chain; AltName: Full=Alpha-2 type I collagen; Flags: Precursor)

HSP 1 Score: 120.553 bits (301), Expect = 1.361e-25
Identity = 78/225 (34.67%), Postives = 118/225 (52.44%), Query Frame = 3
Query: 7524 NNAIKN---PQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSW--NSPITKGHTWLKSLLNSD-EIQYSIPNG--------QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDT-IIKMTQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
            NN I+    P+G+++ PARTC+ L   +P    G YWIDPN G T DA+KV+C  S  +TCI+   ++I +++W  NS   K H W+   +N   + +Y++  G        QLAF+++ ++ A+Q IT+ C+N      EE  N   A  L   +D  ++    S+  YTV+ D C    +   ++IIE K +  S LPI DI    I     +  L I  VCF
Sbjct: 1141 NNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCIRAQPEDIPVKNWYRNSKAKK-HVWVGETINGGTQFEYNV-EGVTTKEMATQLAFMRLLANHASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKKTNEWQKTIIEYKTNKPSRLPILDIAPLDIGGADQEIRLNIGPVCF 1363          

HSP 2 Score: 120.553 bits (301), Expect = 1.361e-25
Identity = 78/225 (34.67%), Postives = 118/225 (52.44%), Query Frame = -3
Query:  121 NNAIKN---PQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSW--NSPITKGHTWLKSLLNSD-EIQYSIPNG--------QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDT-IIKMTQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 747
            NN I+    P+G+++ PARTC+ L   +P    G YWIDPN G T DA+KV+C  S  +TCI+   ++I +++W  NS   K H W+   +N   + +Y++  G        QLAF+++ ++ A+Q IT+ C+N      EE  N   A  L   +D  ++    S+  YTV+ D C    +   ++IIE K +  S LPI DI    I     +  L I  VCF
Sbjct: 1141 NNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCIRAQPEDIPVKNWYRNSKAKK-HVWVGETINGGTQFEYNV-EGVTTKEMATQLAFMRLLANHASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKKTNEWQKTIIEYKTNKPSRLPILDIAPLDIGGADQEIRLNIGPVCF 1363          

HSP 3 Score: 59.3066 bits (142), Expect = 5.082e-7
Identity = 50/109 (45.87%), Postives = 64/109 (58.72%), Query Frame = 3
Query: 4962 GKSGKPGLRGRTGP---DGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 5279
            G +G PG +G  GP    GN G+ G  G +G  G +GP+G +GP+G+RG  G PGD GP+G  G +G  GL+G  G  G  G +G  G  GP G  G  GP+GP G DG
Sbjct:  956 GAAGAPGPQGPVGPVGKHGNRGEPGPAGAVGPAGAVGPRGPSGPQGIRGDKGEPGDKGPRGLPGLKGHNGLQGLPGLAGHHGDQGAPGAVGPAGPRGPAGPSGPAGKDG 1064          

HSP 4 Score: 59.3066 bits (142), Expect = 5.082e-7
Identity = 50/109 (45.87%), Postives = 64/109 (58.72%), Query Frame = -3
Query: 2992 GKSGKPGLRGRTGP---DGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 3309
            G +G PG +G  GP    GN G+ G  G +G  G +GP+G +GP+G+RG  G PGD GP+G  G +G  GL+G  G  G  G +G  G  GP G  G  GP+GP G DG
Sbjct:  956 GAAGAPGPQGPVGPVGKHGNRGEPGPAGAVGPAGAVGPRGPSGPQGIRGDKGEPGDKGPRGLPGLKGHNGLQGLPGLAGHHGDQGAPGAVGPAGPRGPAGPSGPAGKDG 1064          
BLAST of collagen vs. RefSeq Drosophila melanogaster
Match: gi|24581820|ref|NP_723044.1| (collagen type IV, isoform A [Drosophila melanogaster])

HSP 1 Score: 56.6102 bits (135), Expect = 4.632e-7
Identity = 104/251 (41.43%), Postives = 122/251 (48.61%), Query Frame = 3
Query: 4596 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPG-------DSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGL---------IGPTGPIGPDGV 5282
            ++G PGP G  G TGP G PG  GEKG  G           G  GPPG  G  G++G  G  G  GE+G  G  G+ G PG +G +G+PG+PG PG  G  G  G                 G  G PG RG  G  G +G  G KGE G VGL G  G  GP+G RG  G PG         G KGSQG++G  G +G+      +GP G  G KGDRG  GP G +GL         IGP G IG  GV
Sbjct: 1065 QKGEPGPSGLRGDTGPAGTPGWPGEKGLPG-------LAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGA---------------PGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGV 1293          

HSP 2 Score: 56.6102 bits (135), Expect = 4.632e-7
Identity = 104/251 (41.43%), Postives = 122/251 (48.61%), Query Frame = -3
Query: 2989 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPG-------DSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGL---------IGPTGPIGPDGV 3675
            ++G PGP G  G TGP G PG  GEKG  G           G  GPPG  G  G++G  G  G  GE+G  G  G+ G PG +G +G+PG+PG PG  G  G  G                 G  G PG RG  G  G +G  G KGE G VGL G  G  GP+G RG  G PG         G KGSQG++G  G +G+      +GP G  G KGDRG  GP G +GL         IGP G IG  GV
Sbjct: 1065 QKGEPGPSGLRGDTGPAGTPGWPGEKGLPG-------LAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGA---------------PGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGV 1293          

HSP 3 Score: 55.4546 bits (132), Expect = 1.056e-6
Identity = 50/104 (48.08%), Postives = 58/104 (55.77%), Query Frame = 3
Query: 4977 PGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGP--KGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 5282
            P +RG  G  G  G  G KGE G+ GL GP G+ G +G RG  G PG SG  G  G +G IG  G+ G  G   KG+KG  GRPG  G  GLIG  G IG  G+
Sbjct: 1222 PDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGL 1325          

HSP 4 Score: 55.4546 bits (132), Expect = 1.056e-6
Identity = 50/104 (48.08%), Postives = 58/104 (55.77%), Query Frame = -3
Query: 2989 PGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGP--KGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 3294
            P +RG  G  G  G  G KGE G+ GL GP G+ G +G RG  G PG SG  G  G +G IG  G+ G  G   KG+KG  GRPG  G  GLIG  G IG  G+
Sbjct: 1222 PDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGL 1325          

HSP 5 Score: 54.6842 bits (130), Expect = 1.704e-6
Identity = 44/95 (46.32%), Postives = 55/95 (57.89%), Query Frame = 3
Query: 4959 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAG 5243
            +G  G PGL+G TGP G  G++G  G  G  G  G +GL GP GL G  G  GD+G  G  G+ GP+G  G+ G  GPKG+ G  G PG  G+ G
Sbjct: 1362 DGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKG 1456          

HSP 6 Score: 54.6842 bits (130), Expect = 1.704e-6
Identity = 44/95 (46.32%), Postives = 55/95 (57.89%), Query Frame = -3
Query: 3028 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAG 3312
            +G  G PGL+G TGP G  G++G  G  G  G  G +GL GP GL G  G  GD+G  G  G+ GP+G  G+ G  GPKG+ G  G PG  G+ G
Sbjct: 1362 DGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKG 1456          
BLAST of collagen vs. RefSeq Drosophila melanogaster
Match: gi|24581822|ref|NP_723045.1| (collagen type IV, isoform B [Drosophila melanogaster])

HSP 1 Score: 56.6102 bits (135), Expect = 4.632e-7
Identity = 104/251 (41.43%), Postives = 122/251 (48.61%), Query Frame = 3
Query: 4596 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPG-------DSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGL---------IGPTGPIGPDGV 5282
            ++G PGP G  G TGP G PG  GEKG  G           G  GPPG  G  G++G  G  G  GE+G  G  G+ G PG +G +G+PG+PG PG  G  G  G                 G  G PG RG  G  G +G  G KGE G VGL G  G  GP+G RG  G PG         G KGSQG++G  G +G+      +GP G  G KGDRG  GP G +GL         IGP G IG  GV
Sbjct: 1065 QKGEPGPSGLRGDTGPAGTPGWPGEKGLPG-------LAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGA---------------PGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGV 1293          

HSP 2 Score: 56.6102 bits (135), Expect = 4.632e-7
Identity = 104/251 (41.43%), Postives = 122/251 (48.61%), Query Frame = -3
Query: 2989 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPG-------DSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGL---------IGPTGPIGPDGV 3675
            ++G PGP G  G TGP G PG  GEKG  G           G  GPPG  G  G++G  G  G  GE+G  G  G+ G PG +G +G+PG+PG PG  G  G  G                 G  G PG RG  G  G +G  G KGE G VGL G  G  GP+G RG  G PG         G KGSQG++G  G +G+      +GP G  G KGDRG  GP G +GL         IGP G IG  GV
Sbjct: 1065 QKGEPGPSGLRGDTGPAGTPGWPGEKGLPG-------LAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGA---------------PGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGV 1293          

HSP 3 Score: 55.4546 bits (132), Expect = 1.056e-6
Identity = 50/104 (48.08%), Postives = 58/104 (55.77%), Query Frame = 3
Query: 4977 PGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGP--KGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 5282
            P +RG  G  G  G  G KGE G+ GL GP G+ G +G RG  G PG SG  G  G +G IG  G+ G  G   KG+KG  GRPG  G  GLIG  G IG  G+
Sbjct: 1222 PDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGL 1325          

HSP 4 Score: 55.4546 bits (132), Expect = 1.056e-6
Identity = 50/104 (48.08%), Postives = 58/104 (55.77%), Query Frame = -3
Query: 2989 PGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGP--KGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 3294
            P +RG  G  G  G  G KGE G+ GL GP G+ G +G RG  G PG SG  G  G +G IG  G+ G  G   KG+KG  GRPG  G  GLIG  G IG  G+
Sbjct: 1222 PDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGL 1325          

HSP 5 Score: 54.6842 bits (130), Expect = 1.704e-6
Identity = 44/95 (46.32%), Postives = 55/95 (57.89%), Query Frame = 3
Query: 4959 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAG 5243
            +G  G PGL+G TGP G  G++G  G  G  G  G +GL GP GL G  G  GD+G  G  G+ GP+G  G+ G  GPKG+ G  G PG  G+ G
Sbjct: 1362 DGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKG 1456          

HSP 6 Score: 54.6842 bits (130), Expect = 1.704e-6
Identity = 44/95 (46.32%), Postives = 55/95 (57.89%), Query Frame = -3
Query: 3028 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAG 3312
            +G  G PGL+G TGP G  G++G  G  G  G  G +GL GP GL G  G  GD+G  G  G+ GP+G  G+ G  GPKG+ G  G PG  G+ G
Sbjct: 1362 DGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKG 1456          
BLAST of collagen vs. RefSeq Drosophila melanogaster
Match: gi|24581824|ref|NP_723046.1| (collagen type IV, isoform C [Drosophila melanogaster])

HSP 1 Score: 56.6102 bits (135), Expect = 4.632e-7
Identity = 104/251 (41.43%), Postives = 122/251 (48.61%), Query Frame = 3
Query: 4596 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPG-------DSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGL---------IGPTGPIGPDGV 5282
            ++G PGP G  G TGP G PG  GEKG  G           G  GPPG  G  G++G  G  G  GE+G  G  G+ G PG +G +G+PG+PG PG  G  G  G                 G  G PG RG  G  G +G  G KGE G VGL G  G  GP+G RG  G PG         G KGSQG++G  G +G+      +GP G  G KGDRG  GP G +GL         IGP G IG  GV
Sbjct: 1065 QKGEPGPSGLRGDTGPAGTPGWPGEKGLPG-------LAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGA---------------PGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGV 1293          

HSP 2 Score: 56.6102 bits (135), Expect = 4.632e-7
Identity = 104/251 (41.43%), Postives = 122/251 (48.61%), Query Frame = -3
Query: 2989 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPG-------DSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGL---------IGPTGPIGPDGV 3675
            ++G PGP G  G TGP G PG  GEKG  G           G  GPPG  G  G++G  G  G  GE+G  G  G+ G PG +G +G+PG+PG PG  G  G  G                 G  G PG RG  G  G +G  G KGE G VGL G  G  GP+G RG  G PG         G KGSQG++G  G +G+      +GP G  G KGDRG  GP G +GL         IGP G IG  GV
Sbjct: 1065 QKGEPGPSGLRGDTGPAGTPGWPGEKGLPG-------LAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGA---------------PGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGV 1293          

HSP 3 Score: 55.4546 bits (132), Expect = 1.056e-6
Identity = 50/104 (48.08%), Postives = 58/104 (55.77%), Query Frame = 3
Query: 4977 PGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGP--KGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 5282
            P +RG  G  G  G  G KGE G+ GL GP G+ G +G RG  G PG SG  G  G +G IG  G+ G  G   KG+KG  GRPG  G  GLIG  G IG  G+
Sbjct: 1222 PDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGL 1325          

HSP 4 Score: 55.4546 bits (132), Expect = 1.056e-6
Identity = 50/104 (48.08%), Postives = 58/104 (55.77%), Query Frame = -3
Query: 2989 PGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGP--KGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 3294
            P +RG  G  G  G  G KGE G+ GL GP G+ G +G RG  G PG SG  G  G +G IG  G+ G  G   KG+KG  GRPG  G  GLIG  G IG  G+
Sbjct: 1222 PDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGL 1325          

HSP 5 Score: 54.6842 bits (130), Expect = 1.704e-6
Identity = 44/95 (46.32%), Postives = 55/95 (57.89%), Query Frame = 3
Query: 4959 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAG 5243
            +G  G PGL+G TGP G  G++G  G  G  G  G +GL GP GL G  G  GD+G  G  G+ GP+G  G+ G  GPKG+ G  G PG  G+ G
Sbjct: 1362 DGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKG 1456          

HSP 6 Score: 54.6842 bits (130), Expect = 1.704e-6
Identity = 44/95 (46.32%), Postives = 55/95 (57.89%), Query Frame = -3
Query: 3028 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAG 3312
            +G  G PGL+G TGP G  G++G  G  G  G  G +GL GP GL G  G  GD+G  G  G+ GP+G  G+ G  GPKG+ G  G PG  G+ G
Sbjct: 1362 DGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKG 1456          
BLAST of collagen vs. RefSeq Drosophila melanogaster
Match: gi|442619464|ref|NP_001262641.1| (CG42342, isoform T [Drosophila melanogaster])

HSP 1 Score: 54.299 bits (129), Expect = 2.133e-6
Identity = 48/101 (47.52%), Postives = 60/101 (59.41%), Query Frame = 3
Query: 4980 GLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 5282
            G+RG +GP G +GK G  G  G  G+ G QG TG +G RG  G PG  G  G +G +G  G  G +GP G +G+KGDRG  G QG  GL  P  P+G DG+
Sbjct:  648 GMRGESGPSGPSGKAGIPGPPGLDGMKGAQGETGHKGERGDPGLPGTDGIPGQEGPRGEQGSRGDAGPPGKRGRKGDRGDKGEQGVPGLDAPC-PLGADGL 747          

HSP 2 Score: 54.299 bits (129), Expect = 2.133e-6
Identity = 48/101 (47.52%), Postives = 60/101 (59.41%), Query Frame = -3
Query: 2989 GLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 3291
            G+RG +GP G +GK G  G  G  G+ G QG TG +G RG  G PG  G  G +G +G  G  G +GP G +G+KGDRG  G QG  GL  P  P+G DG+
Sbjct:  648 GMRGESGPSGPSGKAGIPGPPGLDGMKGAQGETGHKGERGDPGLPGTDGIPGQEGPRGEQGSRGDAGPPGKRGRKGDRGDKGEQGVPGLDAPC-PLGADGL 747          
BLAST of collagen vs. RefSeq Drosophila melanogaster
Match: gi|442619462|ref|NP_001247141.2| (CG42342, isoform S [Drosophila melanogaster])

HSP 1 Score: 53.9138 bits (128), Expect = 2.564e-6
Identity = 48/101 (47.52%), Postives = 60/101 (59.41%), Query Frame = 3
Query: 4980 GLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 5282
            G+RG +GP G +GK G  G  G  G+ G QG TG +G RG  G PG  G  G +G +G  G  G +GP G +G+KGDRG  G QG  GL  P  P+G DG+
Sbjct:  643 GMRGESGPSGPSGKAGIPGPPGLDGMKGAQGETGHKGERGDPGLPGTDGIPGQEGPRGEQGSRGDAGPPGKRGRKGDRGDKGEQGVPGLDAPC-PLGADGL 742          

HSP 2 Score: 53.9138 bits (128), Expect = 2.564e-6
Identity = 48/101 (47.52%), Postives = 60/101 (59.41%), Query Frame = -3
Query: 2989 GLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 3291
            G+RG +GP G +GK G  G  G  G+ G QG TG +G RG  G PG  G  G +G +G  G  G +GP G +G+KGDRG  G QG  GL  P  P+G DG+
Sbjct:  643 GMRGESGPSGPSGKAGIPGPPGLDGMKGAQGETGHKGERGDPGLPGTDGIPGQEGPRGEQGSRGDAGPPGKRGRKGDRGDKGEQGVPGLDAPC-PLGADGL 742          
BLAST of collagen vs. Smed Unigenes AA
Match: SMU15040033 (dd_smedV4_702_0_1|m.35199|m.6295)

HSP 1 Score: 443.736 bits (1140), Expect = 6.448e-129
Identity = 235/288 (81.60%), Postives = 262/288 (90.97%), Query Frame = 3
Query: 7293 QGPVGITGPRGDPGIPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXXIMAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXXRKSKFGLTIEEVCFS* 8156
            QGPVGI GPRGDPGIPG IGPTGL GKKG  G +G +GPLGPPGPPGPPGGIMAMQ+RSPTKGV YGDDP AAELLGNNAIKNP+GTKEVPA TCKQLS ++PN+PDGEYWIDPNGGR +DAVKV+C+ISE+KTCIKP+N EIS+RSW S    GHTWLKS+LN +EIQYSIPNGQ+AFLK+ SD+A QR+TF CENHPIIG+EEKLNKV APRLLADDDTIIKMT S LKYTVIKDECQYSKSSEAESIIE+++ A+LLPIRDIG++IINNRKSKFG+TIEEVCFS*
Sbjct:  999 QGPVGIIGPRGDPGIPGPIGPTGLHGKKGGIGIMGPVGPLGPPGPPGPPGGIMAMQMRSPTKGVTYGDDPLAAELLGNNAIKNPEGTKEVPAITCKQLSVKHPNLPDGEYWIDPNGGRVNDAVKVYCRISEQKTCIKPINNEISLRSWKSHSANGHTWLKSILNKEEIQYSIPNGQIAFLKVNSDSAVQRVTFTCENHPIIGNEEKLNKVTAPRLLADDDTIIKMTHSHLKYTVIKDECQYSKSSEAESIIEVRNYANLLPIRDIGVSIINNRKSKFGVTIEEVCFS* 1286          

HSP 2 Score: 443.736 bits (1140), Expect = 6.448e-129
Identity = 235/288 (81.60%), Postives = 262/288 (90.97%), Query Frame = -3
Query:  115 QGPVGITGPRGDPGIPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXGIMAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXNRKSKFGLTIEEVCFS* 978
            QGPVGI GPRGDPGIPG IGPTGL GKKG  G +G +GPLGPPGPPGPPGGIMAMQ+RSPTKGV YGDDP AAELLGNNAIKNP+GTKEVPA TCKQLS ++PN+PDGEYWIDPNGGR +DAVKV+C+ISE+KTCIKP+N EIS+RSW S    GHTWLKS+LN +EIQYSIPNGQ+AFLK+ SD+A QR+TF CENHPIIG+EEKLNKV APRLLADDDTIIKMT S LKYTVIKDECQYSKSSEAESIIE+++ A+LLPIRDIG++IINNRKSKFG+TIEEVCFS*
Sbjct:  999 QGPVGIIGPRGDPGIPGPIGPTGLHGKKGGIGIMGPVGPLGPPGPPGPPGGIMAMQMRSPTKGVTYGDDPLAAELLGNNAIKNPEGTKEVPAITCKQLSVKHPNLPDGEYWIDPNGGRVNDAVKVYCRISEQKTCIKPINNEISLRSWKSHSANGHTWLKSILNKEEIQYSIPNGQIAFLKVNSDSAVQRVTFTCENHPIIGNEEKLNKVTAPRLLADDDTIIKMTHSHLKYTVIKDECQYSKSSEAESIIEVRNYANLLPIRDIGVSIINNRKSKFGVTIEEVCFS* 1286          

HSP 3 Score: 240.736 bits (613), Expect = 1.042e-63
Identity = 252/311 (81.03%), Postives = 273/311 (87.78%), Query Frame = 3
Query: 4299 MLKISIISGAILLVLIYIECVHGQFRTLNEATGPMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDGQRGAPGLQGPPXXXXXXXXXXXXXXXXXXXXXXEQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQ 5231
            M K SI SGAILL+LIY++  +GQFRTLNEATGP+GIRGNPGKRGK GPDGDPG+ GP GPPG DG RGAPG  GP G  GP+GKSG  G +GPPGSRGEQGPPGP+GKEGLTGP GY GPSGEKGDSG  GEQGD GDIGP GP GP+G PGQ+GPTGPQG VGERG HGPDG+VGPPGPEGR+GSPG PGRPGE+GKKGEGGDEG KGGKGE+GKTGINGKSGKPG+RG  GP G NGKQGRKGE+GD+GL GPQGL GPRG+RG+ GNPGD+GPKGSQGDQGPIGLEGK GPFGPKGQKGDRGRPGPQ
Sbjct:    1 MFKNSIFSGAILLILIYVDFSYGQFRTLNEATGPIGIRGNPGKRGKIGPDGDPGSSGPPGPPGKDGLRGAPGPNGPAGGAGPDGKSGVTGNTGPPGSRGEQGPPGPVGKEGLTGPNGYSGPSGEKGDSGSIGEQGDPGDIGPQGPAGPLGPPGQSGPTGPQGTVGERGPHGPDGVVGPPGPEGRMGSPGSPGRPGELGKKGEGGDEGLKGGKGENGKTGINGKSGKPGIRGPIGPVGINGKQGRKGELGDIGLTGPQGLIGPRGVRGTVGNPGDNGPKGSQGDQGPIGLEGKPGPFGPKGQKGDRGRPGPQ 311          

HSP 4 Score: 240.736 bits (613), Expect = 1.042e-63
Identity = 252/311 (81.03%), Postives = 273/311 (87.78%), Query Frame = -3
Query: 3040 MLKISIISGAILLVLIYIECVHGQFRTLNEATGPMXXXXXXXXXXXXXXXXXXXXXXXXXXXXNDGQRGAPGLQGPPXXXXXXXXXXXXXXXXXXXXXGEQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQ 3972
            M K SI SGAILL+LIY++  +GQFRTLNEATGP+GIRGNPGKRGK GPDGDPG+ GP GPPG DG RGAPG  GP G  GP+GKSG  G +GPPGSRGEQGPPGP+GKEGLTGP GY GPSGEKGDSG  GEQGD GDIGP GP GP+G PGQ+GPTGPQG VGERG HGPDG+VGPPGPEGR+GSPG PGRPGE+GKKGEGGDEG KGGKGE+GKTGINGKSGKPG+RG  GP G NGKQGRKGE+GD+GL GPQGL GPRG+RG+ GNPGD+GPKGSQGDQGPIGLEGK GPFGPKGQKGDRGRPGPQ
Sbjct:    1 MFKNSIFSGAILLILIYVDFSYGQFRTLNEATGPIGIRGNPGKRGKIGPDGDPGSSGPPGPPGKDGLRGAPGPNGPAGGAGPDGKSGVTGNTGPPGSRGEQGPPGPVGKEGLTGPNGYSGPSGEKGDSGSIGEQGDPGDIGPQGPAGPLGPPGQSGPTGPQGTVGERGPHGPDGVVGPPGPEGRMGSPGSPGRPGELGKKGEGGDEGLKGGKGENGKTGINGKSGKPGIRGPIGPVGINGKQGRKGELGDIGLTGPQGLIGPRGVRGTVGNPGDNGPKGSQGDQGPIGLEGKPGPFGPKGQKGDRGRPGPQ 311          

HSP 5 Score: 87.0409 bits (214), Expect = 1.582e-16
Identity = 97/131 (74.05%), Postives = 107/131 (81.68%), Query Frame = 3
Query: 5487 KSGNKGALGPIGPSGLRGPPGNPGKDGMIXXXXXXXXXXXXXNVGAPGNKGNIGEPGPKGKTGNSGKPGPAGKNGADGSEGLSGNIGSPGFXXXXXXXXXXXXXXXXXXXXXXXXXXNQGISGKNGAPGTE 5879
            KSG KG+LGP G +GLRGP GNPGKDG +GPLG PGLRGPPG++G PG KGNIG PG KGK GN+GKPGP GKNG DGSEG  GN GSPGFPGPNGDPG +GPPG  G  G+ GYPGNQG++GKNG PG E
Sbjct:  397 KSGRKGSLGPTGLTGLRGPQGNPGKDGTLGPLGTPGLRGPPGSIGTPGLKGNIGPPGSKGKVGNAGKPGPLGKNGIDGSEGPIGNAGSPGFPGPNGDPGPNGPPGSLGLAGLVGYPGNQGLAGKNGNPGVE 527          

HSP 6 Score: 87.0409 bits (214), Expect = 1.582e-16
Identity = 97/131 (74.05%), Postives = 107/131 (81.68%), Query Frame = -3
Query: 2392 KSGNKGALGPIGPSGLRGPPGNPGKDGMIXXXXXXXXXXXXGNVGAPGNKGNIGEPGPKGKTGNSGKPGPAGKNGADGSEGLSGNIGSPGFXXXXXXXXXXXXXXXXXXXXXXXXXGNQGISGKNGAPGTE 2784
            KSG KG+LGP G +GLRGP GNPGKDG +GPLG PGLRGPPG++G PG KGNIG PG KGK GN+GKPGP GKNG DGSEG  GN GSPGFPGPNGDPG +GPPG  G  G+ GYPGNQG++GKNG PG E
Sbjct:  397 KSGRKGSLGPTGLTGLRGPQGNPGKDGTLGPLGTPGLRGPPGSIGTPGLKGNIGPPGSKGKVGNAGKPGPLGKNGIDGSEGPIGNAGSPGFPGPNGDPGPNGPPGSLGLAGLVGYPGNQGLAGKNGNPGVE 527          

HSP 7 Score: 60.8474 bits (146), Expect = 1.244e-8
Identity = 53/109 (48.62%), Postives = 66/109 (60.55%), Query Frame = 3
Query: 4962 GKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGP------IGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIG 5270
            G +G PG RG  GP+G +GK GRKG       LGP GLTG RG +G+ G  G  GP G+ G +GP       GL+G  GP G KG+ G+ G+PGP G+ G+ G  GPIG
Sbjct:  378 GPNGAPGPRGEIGPNGPDGKSGRKGS------LGPTGLTGLRGPQGNPGKDGTLGPLGTPGLRGPPGSIGTPGLKGNIGPPGSKGKVGNAGKPGPLGKNGIDGSEGPIG 480          

HSP 8 Score: 60.8474 bits (146), Expect = 1.244e-8
Identity = 53/109 (48.62%), Postives = 66/109 (60.55%), Query Frame = -3
Query: 3001 GKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGP------IGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIG 3309
            G +G PG RG  GP+G +GK GRKG       LGP GLTG RG +G+ G  G  GP G+ G +GP       GL+G  GP G KG+ G+ G+PGP G+ G+ G  GPIG
Sbjct:  378 GPNGAPGPRGEIGPNGPDGKSGRKGS------LGPTGLTGLRGPQGNPGKDGTLGPLGTPGLRGPPGSIGTPGLKGNIGPPGSKGKVGNAGKPGPLGKNGIDGSEGPIG 480          
BLAST of collagen vs. Smed Unigenes AA
Match: SMU15002271 (Asxlregen_comp67208_c0_seq1|m.27270|m.10319)

HSP 1 Score: 245.358 bits (625), Expect = 4.276e-65
Identity = 117/232 (50.43%), Postives = 158/232 (68.10%), Query Frame = 3
Query: 7461 LRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXXRKSKFGLTIEEVCFS* 8156
            LRSPTKG+ + DDP+ A   GNNAI  P+GTKEVPAR+CK LS  NP++ DGEYWIDPNGGR SDAV V+C+I+ ++TCIKP+++     SW     K H W +++    E +Y I N QL +LK  S+TATQ+I+  C N  II   +     +   LL DDDTI+ +   K ++ VIKDECQY K SEAE+I+E++  AS LPI+D+G+ I ++R  K G+ + EVC+S*
Sbjct: 1064 LRSPTKGLTFSDDPSVAHSFGNNAIITPRGTKEVPARSCKHLSEHNPDLSDGEYWIDPNGGRVSDAVPVYCRIATQQTCIKPISKIYKTASWFKKYQKDHVWFQTINGIGEFEYDIENYQLNYLKALSETATQQISLNCINQAIILDRQGKMSTVWTSLLGDDDTILSLQHPKRRFKVIKDECQYEKFSEAETILEVRGKASRLPIKDVGLIIDSDRSRKVGIELGEVCYS* 1295          

HSP 2 Score: 245.358 bits (625), Expect = 4.276e-65
Identity = 117/232 (50.43%), Postives = 158/232 (68.10%), Query Frame = -3
Query:  115 LRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXNRKSKFGLTIEEVCFS* 810
            LRSPTKG+ + DDP+ A   GNNAI  P+GTKEVPAR+CK LS  NP++ DGEYWIDPNGGR SDAV V+C+I+ ++TCIKP+++     SW     K H W +++    E +Y I N QL +LK  S+TATQ+I+  C N  II   +     +   LL DDDTI+ +   K ++ VIKDECQY K SEAE+I+E++  AS LPI+D+G+ I ++R  K G+ + EVC+S*
Sbjct: 1064 LRSPTKGLTFSDDPSVAHSFGNNAIITPRGTKEVPARSCKHLSEHNPDLSDGEYWIDPNGGRVSDAVPVYCRIATQQTCIKPISKIYKTASWFKKYQKDHVWFQTINGIGEFEYDIENYQLNYLKALSETATQQISLNCINQAIILDRQGKMSTVWTSLLGDDDTILSLQHPKRRFKVIKDECQYEKFSEAETILEVRGKASRLPIKDVGLIIDSDRSRKVGIELGEVCYS* 1295          

HSP 3 Score: 96.6709 bits (239), Expect = 1.800e-19
Identity = 123/237 (51.90%), Postives = 144/237 (60.76%), Query Frame = 3
Query: 4596 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXINGKSGKPGLRGRTGPDGN---------NGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRP------GPQGEAGLIGPTG 5261
            E G  G IG EG TGP G  G +G+KGD G  G +G  G+ G  GP G  G  G+ GP G QGP GERG  G DG+ G  GP+G IG PG         + G  G+ G KG +GESG  G  G SG PGL G+TGP G+         +G+ G +GE G VG  GP G  G RG RG SGNPG SGPKGSQG++GPIG+EGK G  GPKGQKGD GRP      GP+GE G++GP G
Sbjct:  107 EPGLAGSIGPEGGTGPQGNRGLTGDKGDIGLAGLKGSNGEPGLQGPQGLRGPAGRVGPAGIQGPTGERGKQGTDGVPGSLGPQGAIGPPG---------QSGIPGEIGNKGIRGESGIKGAKGDSGNPGLAGKTGPSGSLGPPGYPGVDGRPGVRGEAGIVGPQGPVGKVGQRGQRGPSGNPGLSGPKGSQGEEGPIGIEGKQGSAGPKGQKGDPGRPGETGDEGPRGERGVVGPAG 334          

HSP 4 Score: 96.6709 bits (239), Expect = 1.800e-19
Identity = 123/237 (51.90%), Postives = 144/237 (60.76%), Query Frame = -3
Query: 3010 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGKSGKPGLRGRTGPDGN---------NGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRP------GPQGEAGLIGPTG 3675
            E G  G IG EG TGP G  G +G+KGD G  G +G  G+ G  GP G  G  G+ GP G QGP GERG  G DG+ G  GP+G IG PG         + G  G+ G KG +GESG  G  G SG PGL G+TGP G+         +G+ G +GE G VG  GP G  G RG RG SGNPG SGPKGSQG++GPIG+EGK G  GPKGQKGD GRP      GP+GE G++GP G
Sbjct:  107 EPGLAGSIGPEGGTGPQGNRGLTGDKGDIGLAGLKGSNGEPGLQGPQGLRGPAGRVGPAGIQGPTGERGKQGTDGVPGSLGPQGAIGPPG---------QSGIPGEIGNKGIRGESGIKGAKGDSGNPGLAGKTGPSGSLGPPGYPGVDGRPGVRGEAGIVGPQGPVGKVGQRGQRGPSGNPGLSGPKGSQGEEGPIGIEGKQGSAGPKGQKGDPGRPGETGDEGPRGERGVVGPAG 334          

HSP 5 Score: 56.225 bits (134), Expect = 3.744e-7
Identity = 57/122 (46.72%), Postives = 70/122 (57.38%), Query Frame = 3
Query: 4959 NGKSGKPGLRGRTGPDGNNGKQGR------KGEIGDVGLLGPQGLTGPRGLRGSSGNPGD---SGPKGSQGD---QGPIGLE---GKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 5279
             GKSGKPG  G  G DG  G  G       +GE G  G +GP+G TGP+G RG +G+ GD   +G KGS G+   QGP GL    G+ GP G +G  G+RG+ G  G  G +GP G IGP G
Sbjct:   75 RGKSGKPGASGIPGKDGTPGSHGVVGPIGPRGEPGLAGSIGPEGGTGPQGNRGLTGDKGDIGLAGLKGSNGEPGLQGPQGLRGPAGRVGPAGIQGPTGERGKQGTDGVPGSLGPQGAIGPPG 196          

HSP 6 Score: 56.225 bits (134), Expect = 3.744e-7
Identity = 57/122 (46.72%), Postives = 70/122 (57.38%), Query Frame = -3
Query: 2992 NGKSGKPGLRGRTGPDGNNGKQGR------KGEIGDVGLLGPQGLTGPRGLRGSSGNPGD---SGPKGSQGD---QGPIGLE---GKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 3312
             GKSGKPG  G  G DG  G  G       +GE G  G +GP+G TGP+G RG +G+ GD   +G KGS G+   QGP GL    G+ GP G +G  G+RG+ G  G  G +GP G IGP G
Sbjct:   75 RGKSGKPGASGIPGKDGTPGSHGVVGPIGPRGEPGLAGSIGPEGGTGPQGNRGLTGDKGDIGLAGLKGSNGEPGLQGPQGLRGPAGRVGPAGIQGPTGERGKQGTDGVPGSLGPQGAIGPPG 196          

HSP 7 Score: 53.5286 bits (127), Expect = 1.951e-6
Identity = 48/107 (44.86%), Postives = 61/107 (57.01%), Query Frame = 3
Query: 4959 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 5279
            +GK G+PG  G  GP G  G +G +G  G+ G  GPQGL G        G+PG  GP+GS G QG    +G +GP GP G++G +GR G  G+ GL G  GP G  G
Sbjct:  699 DGKPGEPGTPGIDGPPGQVGPEGPRGPSGETGEQGPQGLPG------KPGDPGAEGPRGSSGKQG---FQGPTGPIGPTGKQGKQGRAGKSGKNGLTGRKGPAGQRG 796          

HSP 8 Score: 53.5286 bits (127), Expect = 1.951e-6
Identity = 48/107 (44.86%), Postives = 61/107 (57.01%), Query Frame = -3
Query: 2992 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 3312
            +GK G+PG  G  GP G  G +G +G  G+ G  GPQGL G        G+PG  GP+GS G QG    +G +GP GP G++G +GR G  G+ GL G  GP G  G
Sbjct:  699 DGKPGEPGTPGIDGPPGQVGPEGPRGPSGETGEQGPQGLPG------KPGDPGAEGPRGSSGKQG---FQGPTGPIGPTGKQGKQGRAGKSGKNGLTGRKGPAGQRG 796          

HSP 9 Score: 52.7582 bits (125), Expect = 3.607e-6
Identity = 46/99 (46.46%), Postives = 52/99 (52.53%), Query Frame = 3
Query: 4989 GRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDGVK 5285
            GR G  G +GK G  G  G  G  G  G+ GP G RG  G  G  GP+G  G QG  GL G  G  G  G KG  G PG QG  GL GP G +GP G++
Sbjct:   70 GRDGARGKSGKPGASGIPGKDGTPGSHGVVGPIGPRGEPGLAGSIGPEGGTGPQGNRGLTGDKGDIGLAGLKGSNGEPGLQGPQGLRGPAGRVGPAGIQ 168          

HSP 10 Score: 52.7582 bits (125), Expect = 3.607e-6
Identity = 46/99 (46.46%), Postives = 52/99 (52.53%), Query Frame = -3
Query: 2986 GRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDGVK 3282
            GR G  G +GK G  G  G  G  G  G+ GP G RG  G  G  GP+G  G QG  GL G  G  G  G KG  G PG QG  GL GP G +GP G++
Sbjct:   70 GRDGARGKSGKPGASGIPGKDGTPGSHGVVGPIGPRGEPGLAGSIGPEGGTGPQGNRGLTGDKGDIGLAGLKGSNGEPGLQGPQGLRGPAGRVGPAGIQ 168          
BLAST of collagen vs. Smed Unigenes AA
Match: SMU15036469 (dd_smedV4_1070_0_1|m.1593|m.7941)

HSP 1 Score: 168.318 bits (425), Expect = 2.105e-41
Identity = 89/225 (39.56%), Postives = 130/225 (57.78%), Query Frame = 3
Query: 7482 VVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXXR--KSKFGLTIEEVCF 8150
            ++  DDP  A+ LGN+AI  P GT   PAR+C  L+  NP+ PDG YWIDPNGG+  DAV+V+CKI E+KTCIKPL  +I ++           W     ++  I YS+   Q+ FLK+ S+ A+Q IT  C N P+I      N V   R+  D+D I+  +     Y V++D CQ++    + + +E+    + LPI+DI ++ +N R  +S+    IEEVCF
Sbjct: 1078 IIQADDPLIAKYLGNDAISKPLGTSNSPARSCLHLAEMNPSFPDGIYWIDPNGGKIDDAVQVYCKIKEKKTCIKPLVFKIQLQKPK------FNWFSQSNDNKFISYSLDQQQMTFLKMISNKASQFITINCRNMPVI-----KNSVKPLRIFTDNDIILDSSDQIFSYKVLEDNCQHNSQDLSSTRLEITSRPTRLPIKDIEVDTVNVRSERSQIEYNIEEVCF 1291          

HSP 2 Score: 168.318 bits (425), Expect = 2.105e-41
Identity = 89/225 (39.56%), Postives = 130/225 (57.78%), Query Frame = -3
Query:  121 VVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXNR--KSKFGLTIEEVCF 789
            ++  DDP  A+ LGN+AI  P GT   PAR+C  L+  NP+ PDG YWIDPNGG+  DAV+V+CKI E+KTCIKPL  +I ++           W     ++  I YS+   Q+ FLK+ S+ A+Q IT  C N P+I      N V   R+  D+D I+  +     Y V++D CQ++    + + +E+    + LPI+DI ++ +N R  +S+    IEEVCF
Sbjct: 1078 IIQADDPLIAKYLGNDAISKPLGTSNSPARSCLHLAEMNPSFPDGIYWIDPNGGKIDDAVQVYCKIKEKKTCIKPLVFKIQLQKPK------FNWFSQSNDNKFISYSLDQQQMTFLKMISNKASQFITINCRNMPVI-----KNSVKPLRIFTDNDIILDSSDQIFSYKVLEDNCQHNSQDLSSTRLEITSRPTRLPIKDIEVDTVNVRSERSQIEYNIEEVCF 1291          

HSP 3 Score: 61.6178 bits (148), Expect = 7.450e-9
Identity = 54/111 (48.65%), Postives = 64/111 (57.66%), Query Frame = 3
Query: 4956 INGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIG 5270
            +NG  G  G  G  G  G +G  G KGE G +GLLGPQGL+GP GL+GS G PG  G KG++G  GP+G  G        G  G +G+ G  G PGPQG  GL G  GP G
Sbjct:  237 VNGAPGPIGQPGIMGSRGKDGPIGIKGENGPLGLLGPQGLSGPPGLQGSLGPPGPQGSKGNEGKIGPVGPAGSPGSPGLIGEIGERGENGPFGNPGPQGPRGLRGSAGPKG 347          

HSP 4 Score: 61.6178 bits (148), Expect = 7.450e-9
Identity = 54/111 (48.65%), Postives = 64/111 (57.66%), Query Frame = -3
Query: 3001 INGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIG 3315
            +NG  G  G  G  G  G +G  G KGE G +GLLGPQGL+GP GL+GS G PG  G KG++G  GP+G  G        G  G +G+ G  G PGPQG  GL G  GP G
Sbjct:  237 VNGAPGPIGQPGIMGSRGKDGPIGIKGENGPLGLLGPQGLSGPPGLQGSLGPPGPQGSKGNEGKIGPVGPAGSPGSPGLIGEIGERGENGPFGNPGPQGPRGLRGSAGPKG 347          
BLAST of collagen vs. Smed Unigenes AA
Match: SMU15029208 (SmedSxlregen_c102983_g1_i1|m.43148|m.11576)

HSP 1 Score: 153.295 bits (386), Expect = 9.712e-37
Identity = 104/299 (34.78%), Postives = 159/299 (53.18%), Query Frame = 3
Query: 7293 QGPVGITGPRGDPG---------IPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXXIMAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWN-SPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXXR-KSKFGLTIEEVCFS* 8156
            QG VG TG +G PG         +PG  GP GL                    P  P    +   +R+  + ++  DD  A + LG++ +K P GTK++PARTCKQL   NPN+ DG Y+IDPNGG+  DA +V C+   +++CI+P +    ++ W+ S +T+  +W   +  + +  Y I   QL FLK++S  A QRIT  C    ++G+ E  + V+   L +D D  +       KY+VI+D C+ S      + +E+   A  LPIRDI +N  +++ + +FGL I +VCFS*
Sbjct: 1038 QGSVGPTGTKGFPGENGSPGPVGMPGRDGPAGLP-----------GPVGNTGPPGPPGPPAVFFPVRTVRRDLLN-DDALAVKYLGSDVVKKPLGTKDIPARTCKQLLDANPNLQDGFYYIDPNGGKADDAFRVLCRSQRKESCIEPKSPSYKLKHWDASDVTEYRSWFGEITGTFKFDYQIEASQLMFLKLFSTNARQRITINCSRLSVVGNSE--HPVI---LYSDHDEEVLRNGDLFKYSVIRDGCKNSAEIIDSTELEMDTEAIRLPIRDIALNTGSSKEQQQFGLDIGQVCFS* 1319          

HSP 2 Score: 153.295 bits (386), Expect = 9.712e-37
Identity = 104/299 (34.78%), Postives = 159/299 (53.18%), Query Frame = -3
Query:  115 QGPVGITGPRGDPG---------IPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXGIMAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWN-SPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXNR-KSKFGLTIEEVCFS* 978
            QG VG TG +G PG         +PG  GP GL                    P  P    +   +R+  + ++  DD  A + LG++ +K P GTK++PARTCKQL   NPN+ DG Y+IDPNGG+  DA +V C+   +++CI+P +    ++ W+ S +T+  +W   +  + +  Y I   QL FLK++S  A QRIT  C    ++G+ E  + V+   L +D D  +       KY+VI+D C+ S      + +E+   A  LPIRDI +N  +++ + +FGL I +VCFS*
Sbjct: 1038 QGSVGPTGTKGFPGENGSPGPVGMPGRDGPAGLP-----------GPVGNTGPPGPPGPPAVFFPVRTVRRDLLN-DDALAVKYLGSDVVKKPLGTKDIPARTCKQLLDANPNLQDGFYYIDPNGGKADDAFRVLCRSQRKESCIEPKSPSYKLKHWDASDVTEYRSWFGEITGTFKFDYQIEASQLMFLKLFSTNARQRITINCSRLSVVGNSE--HPVI---LYSDHDEEVLRNGDLFKYSVIRDGCKNSAEIIDSTELEMDTEAIRLPIRDIALNTGSSKEQQQFGLDIGQVCFS* 1319          

HSP 3 Score: 52.373 bits (124), Expect = 4.820e-6
Identity = 70/198 (35.35%), Postives = 82/198 (41.41%), Query Frame = 3
Query: 4788 VGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQG-LTGPRGLRGSSGNPGDSGPKGS------------------------QGDQGPIGLEGKSGP---------FGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 5279
            +G  G  G  GIVGPPGPEG  G  GLP                                 G PG  G  GP G  GK+G+ G +G V   G  G  +GPRG  G SG PG SGPKG                         +G++GP G++G  GP          GP+G  G  G PG  G  G IGP G IGP+G
Sbjct:  611 IGVPGIPGKPGIVGPPGPEGFKGDKGLP---------------------------------GNPGTPGVIGPQGLRGKRGKAGGLGKVSFTGRSGGQSGPRGKPGPSGKPGTSGPKGVAGPPGPMGEPGPTGPTGPIGSIGLKGERGPNGIDGSIGPPGRNGAPGAVGPQGLIGLPGTPGTTGSVGEIGPPGQIGPNG 775          

HSP 4 Score: 52.373 bits (124), Expect = 4.820e-6
Identity = 70/198 (35.35%), Postives = 82/198 (41.41%), Query Frame = -3
Query: 2992 VGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQG-LTGPRGLRGSSGNPGDSGPKGS------------------------QGDQGPIGLEGKSGP---------FGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 3483
            +G  G  G  GIVGPPGPEG  G  GLP                                 G PG  G  GP G  GK+G+ G +G V   G  G  +GPRG  G SG PG SGPKG                         +G++GP G++G  GP          GP+G  G  G PG  G  G IGP G IGP+G
Sbjct:  611 IGVPGIPGKPGIVGPPGPEGFKGDKGLP---------------------------------GNPGTPGVIGPQGLRGKRGKAGGLGKVSFTGRSGGQSGPRGKPGPSGKPGTSGPKGVAGPPGPMGEPGPTGPTGPIGSIGLKGERGPNGIDGSIGPPGRNGAPGAVGPQGLIGLPGTPGTTGSVGEIGPPGQIGPNG 775          
BLAST of collagen vs. Smed Unigenes AA
Match: SMU15040144 (dd_smedV4_740_0_1|m.36260|m.3408)

HSP 1 Score: 147.517 bits (371), Expect = 4.887e-35
Identity = 114/289 (39.45%), Postives = 166/289 (57.44%), Query Frame = 3
Query: 7296 GPVGITGPRGDPGIPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXXIMAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWN-SPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLL-ADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXXRKSKFGLTIEEVCFS* 8156
            GP G TGP G+ G+   +GP G++G+ G +GP G +G  GPPGPPGPP  ++ + +  P +  +Y DD  AA +LG++ I  P GTK++PAR+C  L S + ++ DG Y+IDPNGG+ +DA +VFCK+   +TCI P     S  S++ +PI K +     L       Y I N QL FLK+ S  A Q I  AC N  ++       K   P ++  D++  +      L Y VIKDECQ   S EAE+++ +   +  LPIRD+     ++  S+F L + +VCFS*
Sbjct: 1043 GPKGPTGPNGEMGL---MGPMGVTGRDGPSGPHGLMGNAGPPGPPGPPAMMLPI-IYDPNR-PMYSDDANAANILGSDTISVPLGTKDLPARSCNHLKSTSSHLKDGTYFIDPNGGKMNDAFEVFCKMETGETCISPKQSSFSKISYSENPINK-YISYGELSGIQRFDYVIDNTQLMFLKMVSTRANQEIKIACNNMAVV------EKTEYPAIIFTDNNRELTKDDHHLSYKVIKDECQNMSSEEAETVLLVSGDSKRLPIRDL-TLGSDSDISEFRLKLSKVCFS* 1318          

HSP 2 Score: 147.517 bits (371), Expect = 4.887e-35
Identity = 114/289 (39.45%), Postives = 166/289 (57.44%), Query Frame = -3
Query:  115 GPVGITGPRGDPGIPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXGIMAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWN-SPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLL-ADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXNRKSKFGLTIEEVCFS* 975
            GP G TGP G+ G+   +GP G++G+ G +GP G +G  GPPGPPGPP  ++ + +  P +  +Y DD  AA +LG++ I  P GTK++PAR+C  L S + ++ DG Y+IDPNGG+ +DA +VFCK+   +TCI P     S  S++ +PI K +     L       Y I N QL FLK+ S  A Q I  AC N  ++       K   P ++  D++  +      L Y VIKDECQ   S EAE+++ +   +  LPIRD+     ++  S+F L + +VCFS*
Sbjct: 1043 GPKGPTGPNGEMGL---MGPMGVTGRDGPSGPHGLMGNAGPPGPPGPPAMMLPI-IYDPNR-PMYSDDANAANILGSDTISVPLGTKDLPARSCNHLKSTSSHLKDGTYFIDPNGGKMNDAFEVFCKMETGETCISPKQSSFSKISYSENPINK-YISYGELSGIQRFDYVIDNTQLMFLKMVSTRANQEIKIACNNMAVV------EKTEYPAIIFTDNNRELTKDDHHLSYKVIKDECQNMSSEEAETVLLVSGDSKRLPIRDL-TLGSDSDISEFRLKLSKVCFS* 1318          

HSP 3 Score: 59.3066 bits (142), Expect = 3.677e-8
Identity = 49/100 (49.00%), Postives = 58/100 (58.00%), Query Frame = 3
Query: 4971 GKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIG 5270
            GKPG  G+ G  G +GK G  GE G  G +GPQGL GP G +G  G PGD G  G +G  G +G +G  GP GP G  G+ G  GP G  G  GP+GP G
Sbjct:  974 GKPGKPGKEGKPGKDGKTGPVGEPGHPGWMGPQGLLGPPGPQGDRGKPGDPGSPGIEGSPGDVGDQGVPGPKGPTGPNGEMGLMGPMGVTGRDGPSGPHG 1073          

HSP 4 Score: 59.3066 bits (142), Expect = 3.677e-8
Identity = 49/100 (49.00%), Postives = 58/100 (58.00%), Query Frame = -3
Query: 3001 GKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIG 3300
            GKPG  G+ G  G +GK G  GE G  G +GPQGL GP G +G  G PGD G  G +G  G +G +G  GP GP G  G+ G  GP G  G  GP+GP G
Sbjct:  974 GKPGKPGKEGKPGKDGKTGPVGEPGHPGWMGPQGLLGPPGPQGDRGKPGDPGSPGIEGSPGDVGDQGVPGPKGPTGPNGEMGLMGPMGVTGRDGPSGPHG 1073          
The following BLAST results are available for this feature:
BLAST of collagen vs. RefSeq Human
Analysis Date: 2016-03-08 (Girardia Sp. BLASTX Human)
Total hits: 5
Match NameE-valueIdentityDescription
gi|767973237|ref|XP_011536230.1|4.849e-2532.42PREDICTED: collagen alpha-1(II) chain isoform X1 [... [more]
gi|767973239|ref|XP_011536231.1|4.849e-2532.42PREDICTED: collagen alpha-1(II) chain isoform X1 [... [more]
gi|767973241|ref|XP_011536232.1|4.849e-2532.42PREDICTED: collagen alpha-1(II) chain isoform X1 [... [more]
gi|767973243|ref|XP_011536233.1|4.849e-2532.42PREDICTED: collagen alpha-1(II) chain isoform X1 [... [more]
gi|767973245|ref|XP_011536234.1|4.849e-2532.42PREDICTED: collagen alpha-1(II) chain isoform X1 [... [more]
back to top
BLAST of collagen vs. uniprot
Analysis Date: 2016-03-08 (Girardia Sp. BLASTX Swissprot Uniprot)
Total hits: 5
Match NameE-valueIdentityDescription
gi|18202526|sp|Q28668|CO1A2_RABIT1.693e-2936.51RecName: Full=Collagen alpha-2(I) chain; AltName: ... [more]
gi|115286|sp|P02460|CO2A1_CHICK3.468e-2632.73RecName: Full=Collagen alpha-1(II) chain; AltName:... [more]
gi|82202407|sp|Q6P4Z2|CO2A1_XENTR4.162e-2635.29RecName: Full=Collagen alpha-1(II) chain; AltName:... [more]
gi|146286085|sp|Q91717|CO2A1_XENLA1.212e-2535.40RecName: Full=Collagen alpha-1(II) chain; AltName:... [more]
gi|8039779|sp|P02465|CO1A2_BOVIN1.361e-2534.67RecName: Full=Collagen alpha-2(I) chain; AltName: ... [more]
back to top
BLAST of collagen vs. RefSeq Drosophila melanogaster
Analysis Date: 2016-03-09 (Girardia Sp. BLASTX Drosophila melanogaster)
Total hits: 5
Match NameE-valueIdentityDescription
gi|24581820|ref|NP_723044.1|4.632e-741.43collagen type IV, isoform A [Drosophila melanogast... [more]
gi|24581822|ref|NP_723045.1|4.632e-741.43collagen type IV, isoform B [Drosophila melanogast... [more]
gi|24581824|ref|NP_723046.1|4.632e-741.43collagen type IV, isoform C [Drosophila melanogast... [more]
gi|442619464|ref|NP_001262641.1|2.133e-647.52CG42342, isoform T [Drosophila melanogaster][more]
gi|442619462|ref|NP_001247141.2|2.564e-647.52CG42342, isoform S [Drosophila melanogaster][more]
back to top
BLAST of collagen vs. Smed Unigenes AA
Analysis Date: 2016-03-09 (Girardia Sp. BLASTX Schmidtea mediterranea)
Total hits: 5
Match NameE-valueIdentityDescription
SMU150400336.448e-12981.60dd_smedV4_702_0_1|m.35199|m.6295[more]
SMU150022714.276e-6550.43Asxlregen_comp67208_c0_seq1|m.27270|m.10319[more]
SMU150364692.105e-4139.56dd_smedV4_1070_0_1|m.1593|m.7941[more]
SMU150292089.712e-3734.78SmedSxlregen_c102983_g1_i1|m.43148|m.11576[more]
SMU150401444.887e-3539.45dd_smedV4_740_0_1|m.36260|m.3408[more]
back to top
Sequences
The following sequences are available for this feature:

mRNA sequence

>Gsp_012135 ID=Gsp_012135|Name=collagen|organism=Girardia sp.|type=mRNA|length=8270bp
TTTTTTTTGAAAAAAATCAATACGTTTATTATTGATTAATTACAGAACAA
AATAAATTTCATTTAATATACAACTATTAGCAAAAATGTAAATTGATTTT
TTTAAAGTCAGAAATTATGAAAAGCAGACTTCTTCTATTGTCAATCCAAA
TTTACTCTTTCTATTATTGATAATATTGATGCCAATATCACGAATCGGTA
GTAAGCTCGCCAAATGTTTCAATTCTATTATACTTTCAGCTTCACTGGAT
TTGGAATATTGACACTCATCTTTGATAACTGTGTATTTCAATTTTGATTG
TGTCATTTTAATTATGGTATCATCATCAGCTAATAAACGGGGTGCCATTA
CTTTATTTAATTTTTCTTCACTGCCAATTATCGGATGATTTTCACAAGCA
AATGTGATTCTTTGAGTGGCTGTGTCACTATAAATTTTGAGAAATGCGAG
TTGACCATTTGGTATTGAATATTGGATCTCGTCTGAATTTAATAATGATT
TCAACCATGTGTGCCCTTTGGTAATCGGACTATTCCATGATCGAATACTA
ATTTCTTGATTTAATGGTTTAATACAGGTTTTTTCTTCAGATATTTTACA
AAAGACTTTGACAGCATCACTTGTCCTACCACCATTTGGATCTATCCAAT
ATTCGCCATCAGGAATATTTGGATTTTCTGAACTCAACTGTTTACATGTT
CTAGCTGGAACTTCTTTGGTGCCTTGAGGATTTTTAATGGCATTATTTCC
AAGAAGTTCAGCGGCAGCAGGATCATCTCCATAAACAACACCCTTTGTAG
GAGAACGTAGTTGCATAGCCATAATACCACCTGGTGGTCCAGGAGGACCT
GGAGGTCCTAATGGTCCCAATTGGCCTAAAGGACCATTTGAACCTTTCTT
ACCCGAAAGACCAGTTGGACCAATAGCTCCTGGTATTCCAGGATCACCAC
GGGGACCAGTTATTCCAACTGGACCTTGTATTCCATCAGGACCCTTAGGT
CCAACAGGCCCATCTGGACCAGGAAAACCTGGTGGTCCTGGATCTCCTTC
TATCCCTATTAATCCTGGATATCCAGGGACTCCTTTTGGTCCTTTTACAC
CTTTTGCACCTCTAGGACCCGGTCTTCCTTCACGTCCTGGTACTCCATCT
TTTCCTTTTTTTCCTTTTGCTCCTCTCAAACCAACATCTCCAAGTACACC
GCCTGGTCCAGGCTTACCATCTGCACCTTTATCACCAGGAGTTCCTAACT
TTCCTGATTTTCCGCTAGTTCCTCCTTGGCCATCATATCCTTTCCCACCA
CTTGTTCCATCTTTACCTGGACTGCCTGCAGCTCCTTTACTACCAGGAGG
TCCAGGGTCACCTCGTGGACCTACTGGACCAGTAATTCCAGGATCTCCAA
CCACACCGACCGGACCAGGTTTGCCCATTTCTCCATTATCTCCTATTGGA
CCACGATTTCCTGGAGGACCTTTTTTTCCTGGTTTTCCTGTTATCCCCTG
TGGACCCGAGGGTCCTGGTGGACCCAAATATCCAGGATAGCCGTCAGGAC
CAGATGTACCAACTTTTCCATCAGTACCAGGAGCTCCGGGTTTACCTTTT
GGACCTACTGGACCTCTTAAACCTAAAGGACCTTTTCGACCACCTGATCC
ATCTAAACCTGTAACTCCTTGAGGACCAAGAGGTCCAGGTTTTCCAGCAG
GCCCAGGTTTACCTTGAATACCAGATTTTCCTTGTTTCCCTGGACCTCCT
GGACTACCTTGTCTGCCAGGACCACCGCGCGGCCCACCTGGGCCAGATTT
TCCAGGATCACCAACATCTCCGGGTGGACCAGAAGCACCTTCTGGGCCAC
TTAATCCTTCAGGTCCTTGTTTTCCTGGTGCTCCAGGCTTGCCTGGCTTT
CCGTCTGGTCCAGAATCACCAGAAGCACCAATAGGTCCCATATCTCCAGG
ATCCCCAGGGTCGCCAGGATTACCTTCTGGACCTTGTTTTCCAGGTGGTC
CCAGATCACCGGGTAATCCATCCATACCTGGCTTTCCAACTGGTCCTCTA
ATACCAGCAGCTCCAACACTGCGAGGTTTGTGTTTTTCTACTCCTTCTCC
TCCTGCAGATCCTGGTGAACCATTGTCCCCTGGAGGACCTCTTGGCCCAT
CTGGTCCTGGACTGCCAAGCAAACCTGGTGCACCAACCGGACCTTCGGGT
CCTTGTTTTCCCAATGGACCGAGAGCACCTGGTGGGCCATTCGGCCCATA
ATTTCCTGGAATTCCAACTATACCAGGAGCTCCTTTTAATCCATTTGGTC
CAGGAGGACCAGGATTTCCACTGTCACCAATAGGACCAGGAACTCCCGCT
TTACCAACAGGACCTACCGGTCCAGATTTTCCAGGATTTCCTTCAGTTCC
AGGTGCACCATTTTTACCGCTTATACCTTGATTTCCAGGATATCCAGGAA
TTCCAATAGCACCTGCTGGACCAGGAGGTCCACTGGCTCCAGGATCTCCA
TTTGGTCCTGGGAAACCAGGACTACCAATATTTCCACTCAGGCCTTCACT
GCCATCAGCCCCATTCTTTCCAGCTGGTCCAGGTTTTCCAGAATTTCCAG
TTTTGCCTTTTGGTCCTGGTTCACCGATGTTTCCTTTATTTCCAGGTGCT
CCAACATTGCCTGGTGGTCCTCTTAATCCTGGAAGTCCTAGAGGGCCTAT
CATACCATCTTTCCCGGGATTGCCTGGGGGTCCACGTAATCCAGAAGGTC
CTATCGGTCCTAAGGCACCTTTATTTCCAGATTTTCCATCAGGACCTGGG
GGTCCAATTTCTCCTCTGGGTCCAGGAGCTCCATTCGGACCAGGTCTTCC
TAAAGCACCTGTCGAACCACCAGGACCAGAGTTTCCAGGATGACCTGGAT
TTCCAGGTGGACCATCAGCTCCTGGTTTCCCATTTGGTCCTGGACTTCCG
GGACTACCATTAGGTCCGCTTTCACCTCTTGATCCTTTAACTCCATCTGG
GCCTATTGGACCAGTTGGTCCAATCAGTCCGGCTTCTCCCTGGGGTCCTG
GACGACCTCTATCTCCTTTTTGTCCTTTCGGACCAAATGGTCCGGATTTT
CCTTCCAATCCGATTGGACCTTGATCTCCTTGAGAACCTTTAGGACCAGA
ATCTCCTGGATTACCTGATGATCCACGCAACCCTCGTGGTCCTGTTAATC
CTTGTGGACCAAGCAATCCAACATCACCAATTTCACCTTTTCTTCCTTGT
TTCCCATTATTACCATCAGGACCTGTTCGTCCCCTTAAACCTGGCTTTCC
AGATTTACCATTTATTCCAGTTTTACCGCTTTCTCCTTTCCCACCTTTTG
CTCCTTCATCTCCTCCTTCACCCTTTTTGCCTATTTCACCAGGTCGGCCT
GGTAATCCTGGAGAACCAATTCTACCTTCTGGACCAGGTGGACCTACAAT
TCCATCAGGTCCATGAGTACCTCTTTCTCCGACTGGACCTTGTGGACCAG
TAGGACCATTTTGACCAGGTTGACCAATTGGTCCAGGAGGACCTGGTGGA
CCAATATCTCCTGTATCACCCTGTTCACCAGTACCCCCACTGTCTCCTTT
TTCTCCAGATGGTCCTGGATATCCTACTGGACCTGTTAAACCTTCTTTAC
CTATAGGCCCAGGTGGACCTTGTTCACCTCTTGATCCAGGCGGCCCTGAA
ACACCACCAGCTCCAGATTTACCTTCAGGCCCAACAGAACCAGGTGGACC
CTGCAACCCAGGTGCACCTCTTTGTCCATCATTTCCAGGAGGACCTGTTG
GACCCTGTGCACCAGGATCTCCATCAGGACCTGGTTTCCCCCTCTTTCCA
GGATTTCCTCTGATTCCCATGGGACCAGTAGCTTCATTTAAAGTTCTGAA
TTGACCATGTACACATTCAATATAAATTAAAACCAATAAAATAGCTCCAG
AAATAATTGAGATTTTGAGCATTTTAAGTTAAGCAGTTCAAGATAGAACT
GTTAAAAGAAATATTTCCAATTTGTCTTAATTGATATCGGTTTTTGAATA
ATAATTCCTTTAATTTTTGTTAAAAATAATTATAAAGAAAATTTAATTCT
ACCGAATAATTTTGAAAATATATGCAAAAGATTTGCAAATCTTTTGCATA
TATTTTCAAAATTATTCGGTAGAATTAAATTTTCTTTATAATTATTTTTA
ACAAAAATTAAAGGAATTATTATTCAAAAACCGATATCAATTAAGACAAA
TTGGAAATATTTCTTTTAACAGTTCTATCTTGAACTGCTTAACTTAAAAT
GCTCAAAATCTCAATTATTTCTGGAGCTATTTTATTGGTTTTAATTTATA
TTGAATGTGTACATGGTCAATTCAGAACTTTAAATGAAGCTACTGGTCCC
ATGGGAATCAGAGGAAATCCTGGAAAGAGGGGGAAACCAGGTCCTGATGG
AGATCCTGGTGCACAGGGTCCAACAGGTCCTCCTGGAAATGATGGACAAA
GAGGTGCACCTGGGTTGCAGGGTCCACCTGGTTCTGTTGGGCCTGAAGGT
AAATCTGGAGCTGGTGGTGTTTCAGGGCCGCCTGGATCAAGAGGTGAACA
AGGTCCACCTGGGCCTATAGGTAAAGAAGGTTTAACAGGTCCAGTAGGAT
ATCCAGGACCATCTGGAGAAAAAGGAGACAGTGGGGGTACTGGTGAACAG
GGTGATACAGGAGATATTGGTCCACCAGGTCCTCCTGGACCAATTGGTCA
ACCTGGTCAAAATGGTCCTACTGGTCCACAAGGTCCAGTCGGAGAAAGAG
GTACTCATGGACCTGATGGAATTGTAGGTCCACCTGGTCCAGAAGGTAGA
ATTGGTTCTCCAGGATTACCAGGCCGACCTGGTGAAATAGGCAAAAAGGG
TGAAGGAGGAGATGAAGGAGCAAAAGGTGGGAAAGGAGAAAGCGGTAAAA
CTGGAATAAATGGTAAATCTGGAAAGCCAGGTTTAAGGGGACGAACAGGT
CCTGATGGTAATAATGGGAAACAAGGAAGAAAAGGTGAAATTGGTGATGT
TGGATTGCTTGGTCCACAAGGATTAACAGGACCACGAGGGTTGCGTGGAT
CATCAGGTAATCCAGGAGATTCTGGTCCTAAAGGTTCTCAAGGAGATCAA
GGTCCAATCGGATTGGAAGGAAAATCCGGACCATTTGGTCCGAAAGGACA
AAAAGGAGATAGAGGTCGTCCAGGACCCCAGGGAGAAGCCGGACTGATTG
GACCAACTGGTCCAATAGGCCCAGATGGAGTTAAAGGATCAAGAGGTGAA
AGCGGACCTAATGGTAGTCCCGGAAGTCCAGGACCAAATGGGAAACCAGG
AGCTGATGGTCCACCTGGAAATCCAGGTCATCCTGGAAACTCTGGTCCTG
GTGGTTCGACAGGTGCTTTAGGAAGACCTGGTCCGAATGGAGCTCCTGGA
CCCAGAGGAGAAATTGGACCCCCAGGTCCTGATGGAAAATCTGGAAATAA
AGGTGCCTTAGGACCGATAGGACCTTCTGGATTACGTGGACCCCCAGGCA
ATCCCGGGAAAGATGGTATGATAGGCCCTCTAGGACTTCCAGGATTAAGA
GGACCACCAGGCAATGTTGGAGCACCTGGAAATAAAGGAAACATCGGTGA
ACCAGGACCAAAAGGCAAAACTGGAAATTCTGGAAAACCTGGACCAGCTG
GAAAGAATGGGGCTGATGGCAGTGAAGGCCTGAGTGGAAATATTGGTAGT
CCTGGTTTCCCAGGACCAAATGGAGATCCTGGAGCCAGTGGACCTCCTGG
TCCAGCAGGTGCTATTGGAATTCCTGGATATCCTGGAAATCAAGGTATAA
GCGGTAAAAATGGTGCACCTGGAACTGAAGGAAATCCTGGAAAATCTGGA
CCGGTAGGTCCTGTTGGTAAAGCGGGAGTTCCTGGTCCTATTGGTGACAG
TGGAAATCCTGGTCCTCCTGGACCAAATGGATTAAAAGGAGCTCCTGGTA
TAGTTGGAATTCCAGGAAATTATGGGCCGAATGGCCCACCAGGTGCTCTC
GGTCCATTGGGAAAACAAGGACCCGAAGGTCCGGTTGGTGCACCAGGTTT
GCTTGGCAGTCCAGGACCAGATGGGCCAAGAGGTCCTCCAGGGGACAATG
GTTCACCAGGATCTGCAGGAGGAGAAGGAGTAGAAAAACACAAACCTCGC
AGTGTTGGAGCTGCTGGTATTAGAGGACCAGTTGGAAAGCCAGGTATGGA
TGGATTACCCGGTGATCTGGGACCACCTGGAAAACAAGGTCCAGAAGGTA
ATCCTGGCGACCCTGGGGATCCTGGAGATATGGGACCTATTGGTGCTTCT
GGTGATTCTGGACCAGACGGAAAGCCAGGCAAGCCTGGAGCACCAGGAAA
ACAAGGACCTGAAGGATTAAGTGGCCCAGAAGGTGCTTCTGGTCCACCCG
GAGATGTTGGTGATCCTGGAAAATCTGGCCCAGGTGGGCCGCGCGGTGGT
CCTGGCAGACAAGGTAGTCCAGGAGGTCCAGGGAAACAAGGAAAATCTGG
TATTCAAGGTAAACCTGGGCCTGCTGGAAAACCTGGACCTCTTGGTCCTC
AAGGAGTTACAGGTTTAGATGGATCAGGTGGTCGAAAAGGTCCTTTAGGT
TTAAGAGGTCCAGTAGGTCCAAAAGGTAAACCCGGAGCTCCTGGTACTGA
TGGAAAAGTTGGTACATCTGGTCCTGACGGCTATCCTGGATATTTGGGTC
CACCAGGACCCTCGGGTCCACAGGGGATAACAGGAAAACCAGGAAAAAAA
GGTCCTCCAGGAAATCGTGGTCCAATAGGAGATAATGGAGAAATGGGCAA
ACCTGGTCCGGTCGGTGTGGTTGGAGATCCTGGAATTACTGGTCCAGTAG
GTCCACGAGGTGACCCTGGACCTCCTGGTAGTAAAGGAGCTGCAGGCAGT
CCAGGTAAAGATGGAACAAGTGGTGGGAAAGGATATGATGGCCAAGGAGG
AACTAGCGGAAAATCAGGAAAGTTAGGAACTCCTGGTGATAAAGGTGCAG
ATGGTAAGCCTGGACCAGGCGGTGTACTTGGAGATGTTGGTTTGAGAGGA
GCAAAAGGAAAAAAAGGAAAAGATGGAGTACCAGGACGTGAAGGAAGACC
GGGTCCTAGAGGTGCAAAAGGTGTAAAAGGACCAAAAGGAGTCCCTGGAT
ATCCAGGATTAATAGGGATAGAAGGAGATCCAGGACCACCAGGTTTTCCT
GGTCCAGATGGGCCTGTTGGACCTAAGGGTCCTGATGGAATACAAGGTCC
AGTTGGAATAACTGGTCCCCGTGGTGATCCTGGAATACCAGGAGCTATTG
GTCCAACTGGTCTTTCGGGTAAGAAAGGTTCAAATGGTCCTTTAGGCCAA
TTGGGACCATTAGGACCTCCAGGTCCTCCTGGACCACCAGGTGGTATTAT
GGCTATGCAACTACGTTCTCCTACAAAGGGTGTTGTTTATGGAGATGATC
CTGCTGCCGCTGAACTTCTTGGAAATAATGCCATTAAAAATCCTCAAGGC
ACCAAAGAAGTTCCAGCTAGAACATGTAAACAGTTGAGTTCAGAAAATCC
AAATATTCCTGATGGCGAATATTGGATAGATCCAAATGGTGGTAGGACAA
GTGATGCTGTCAAAGTCTTTTGTAAAATATCTGAAGAAAAAACCTGTATT
AAACCATTAAATCAAGAAATTAGTATTCGATCATGGAATAGTCCGATTAC
CAAAGGGCACACATGGTTGAAATCATTATTAAATTCAGACGAGATCCAAT
ATTCAATACCAAATGGTCAACTCGCATTTCTCAAAATTTATAGTGACACA
GCCACTCAAAGAATCACATTTGCTTGTGAAAATCATCCGATAATTGGCAG
TGAAGAAAAATTAAATAAAGTAATGGCACCCCGTTTATTAGCTGATGATG
ATACCATAATTAAAATGACACAATCAAAATTGAAATACACAGTTATCAAA
GATGAGTGTCAATATTCCAAATCCAGTGAAGCTGAAAGTATAATAGAATT
GAAACATTTGGCGAGCTTACTACCGATTCGTGATATTGGCATCAATATTA
TCAATAATAGAAAGAGTAAATTTGGATTGACAATAGAAGAAGTCTGCTTT
TCATAATTTCTGACTTTAAAAAAATCAATTTACATTTTTGCTAATAGTTG
TATATTAAATGAAATTTATTTTGTTCTGTAATTAATCAATAATAAACGTA
TTGATTTTTTTCAAAAAAAA

Design Primers for collagen

back to top

protein sequence

>Gsp_012135-protein ID=Gsp_012135-protein|Name=collagen|organism=Girardia sp.|type=polypeptide|length=1286bp
MLKISIISGAILLVLIYIECVHGQFRTLNEATGPMGIRGNPGKRGKPGPD
GDPGAQGPTGPPGNDGQRGAPGLQGPPGSVGPEGKSGAGGVSGPPGSRGE
QGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEQGDTGDIGPPGPPGPIG
QPGQNGPTGPQGPVGERGTHGPDGIVGPPGPEGRIGSPGLPGRPGEIGKK
GEGGDEGAKGGKGESGKTGINGKSGKPGLRGRTGPDGNNGKQGRKGEIGD
VGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKG
QKGDRGRPGPQGEAGLIGPTGPIGPDGVKGSRGESGPNGSPGSPGPNGKP
GADGPPGNPGHPGNSGPGGSTGALGRPGPNGAPGPRGEIGPPGPDGKSGN
KGALGPIGPSGLRGPPGNPGKDGMIGPLGLPGLRGPPGNVGAPGNKGNIG
EPGPKGKTGNSGKPGPAGKNGADGSEGLSGNIGSPGFPGPNGDPGASGPP
GPAGAIGIPGYPGNQGISGKNGAPGTEGNPGKSGPVGPVGKAGVPGPIGD
SGNPGPPGPNGLKGAPGIVGIPGNYGPNGPPGALGPLGKQGPEGPVGAPG
LLGSPGPDGPRGPPGDNGSPGSAGGEGVEKHKPRSVGAAGIRGPVGKPGM
DGLPGDLGPPGKQGPEGNPGDPGDPGDMGPIGASGDSGPDGKPGKPGAPG
KQGPEGLSGPEGASGPPGDVGDPGKSGPGGPRGGPGRQGSPGGPGKQGKS
GIQGKPGPAGKPGPLGPQGVTGLDGSGGRKGPLGLRGPVGPKGKPGAPGT
DGKVGTSGPDGYPGYLGPPGPSGPQGITGKPGKKGPPGNRGPIGDNGEMG
KPGPVGVVGDPGITGPVGPRGDPGPPGSKGAAGSPGKDGTSGGKGYDGQG
GTSGKSGKLGTPGDKGADGKPGPGGVLGDVGLRGAKGKKGKDGVPGREGR
PGPRGAKGVKGPKGVPGYPGLIGIEGDPGPPGFPGPDGPVGPKGPDGIQG
PVGITGPRGDPGIPGAIGPTGLSGKKGSNGPLGQLGPLGPPGPPGPPGGI
MAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSEN
PNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPI
TKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIG
SEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIE
LKHLASLLPIRDIGINIINNRKSKFGLTIEEVCFS*
back to top

coding sequence

>Gsp_012135.4299.8156 ID=Gsp_012135.4299.8156|Name=Gsp_012135.4299.8156|organism=Girardia sp.|type=CDS|length=3858bp
ATGCTCAAAATCTCAATTATTTCTGGAGCTATTTTATTGGTTTTAATTTA
TATTGAATGTGTACATGGTCAATTCAGAACTTTAAATGAAGCTACTGGTC
CCATGGGAATCAGAGGAAATCCTGGAAAGAGGGGGAAACCAGGTCCTGAT
GGAGATCCTGGTGCACAGGGTCCAACAGGTCCTCCTGGAAATGATGGACA
AAGAGGTGCACCTGGGTTGCAGGGTCCACCTGGTTCTGTTGGGCCTGAAG
GTAAATCTGGAGCTGGTGGTGTTTCAGGGCCGCCTGGATCAAGAGGTGAA
CAAGGTCCACCTGGGCCTATAGGTAAAGAAGGTTTAACAGGTCCAGTAGG
ATATCCAGGACCATCTGGAGAAAAAGGAGACAGTGGGGGTACTGGTGAAC
AGGGTGATACAGGAGATATTGGTCCACCAGGTCCTCCTGGACCAATTGGT
CAACCTGGTCAAAATGGTCCTACTGGTCCACAAGGTCCAGTCGGAGAAAG
AGGTACTCATGGACCTGATGGAATTGTAGGTCCACCTGGTCCAGAAGGTA
GAATTGGTTCTCCAGGATTACCAGGCCGACCTGGTGAAATAGGCAAAAAG
GGTGAAGGAGGAGATGAAGGAGCAAAAGGTGGGAAAGGAGAAAGCGGTAA
AACTGGAATAAATGGTAAATCTGGAAAGCCAGGTTTAAGGGGACGAACAG
GTCCTGATGGTAATAATGGGAAACAAGGAAGAAAAGGTGAAATTGGTGAT
GTTGGATTGCTTGGTCCACAAGGATTAACAGGACCACGAGGGTTGCGTGG
ATCATCAGGTAATCCAGGAGATTCTGGTCCTAAAGGTTCTCAAGGAGATC
AAGGTCCAATCGGATTGGAAGGAAAATCCGGACCATTTGGTCCGAAAGGA
CAAAAAGGAGATAGAGGTCGTCCAGGACCCCAGGGAGAAGCCGGACTGAT
TGGACCAACTGGTCCAATAGGCCCAGATGGAGTTAAAGGATCAAGAGGTG
AAAGCGGACCTAATGGTAGTCCCGGAAGTCCAGGACCAAATGGGAAACCA
GGAGCTGATGGTCCACCTGGAAATCCAGGTCATCCTGGAAACTCTGGTCC
TGGTGGTTCGACAGGTGCTTTAGGAAGACCTGGTCCGAATGGAGCTCCTG
GACCCAGAGGAGAAATTGGACCCCCAGGTCCTGATGGAAAATCTGGAAAT
AAAGGTGCCTTAGGACCGATAGGACCTTCTGGATTACGTGGACCCCCAGG
CAATCCCGGGAAAGATGGTATGATAGGCCCTCTAGGACTTCCAGGATTAA
GAGGACCACCAGGCAATGTTGGAGCACCTGGAAATAAAGGAAACATCGGT
GAACCAGGACCAAAAGGCAAAACTGGAAATTCTGGAAAACCTGGACCAGC
TGGAAAGAATGGGGCTGATGGCAGTGAAGGCCTGAGTGGAAATATTGGTA
GTCCTGGTTTCCCAGGACCAAATGGAGATCCTGGAGCCAGTGGACCTCCT
GGTCCAGCAGGTGCTATTGGAATTCCTGGATATCCTGGAAATCAAGGTAT
AAGCGGTAAAAATGGTGCACCTGGAACTGAAGGAAATCCTGGAAAATCTG
GACCGGTAGGTCCTGTTGGTAAAGCGGGAGTTCCTGGTCCTATTGGTGAC
AGTGGAAATCCTGGTCCTCCTGGACCAAATGGATTAAAAGGAGCTCCTGG
TATAGTTGGAATTCCAGGAAATTATGGGCCGAATGGCCCACCAGGTGCTC
TCGGTCCATTGGGAAAACAAGGACCCGAAGGTCCGGTTGGTGCACCAGGT
TTGCTTGGCAGTCCAGGACCAGATGGGCCAAGAGGTCCTCCAGGGGACAA
TGGTTCACCAGGATCTGCAGGAGGAGAAGGAGTAGAAAAACACAAACCTC
GCAGTGTTGGAGCTGCTGGTATTAGAGGACCAGTTGGAAAGCCAGGTATG
GATGGATTACCCGGTGATCTGGGACCACCTGGAAAACAAGGTCCAGAAGG
TAATCCTGGCGACCCTGGGGATCCTGGAGATATGGGACCTATTGGTGCTT
CTGGTGATTCTGGACCAGACGGAAAGCCAGGCAAGCCTGGAGCACCAGGA
AAACAAGGACCTGAAGGATTAAGTGGCCCAGAAGGTGCTTCTGGTCCACC
CGGAGATGTTGGTGATCCTGGAAAATCTGGCCCAGGTGGGCCGCGCGGTG
GTCCTGGCAGACAAGGTAGTCCAGGAGGTCCAGGGAAACAAGGAAAATCT
GGTATTCAAGGTAAACCTGGGCCTGCTGGAAAACCTGGACCTCTTGGTCC
TCAAGGAGTTACAGGTTTAGATGGATCAGGTGGTCGAAAAGGTCCTTTAG
GTTTAAGAGGTCCAGTAGGTCCAAAAGGTAAACCCGGAGCTCCTGGTACT
GATGGAAAAGTTGGTACATCTGGTCCTGACGGCTATCCTGGATATTTGGG
TCCACCAGGACCCTCGGGTCCACAGGGGATAACAGGAAAACCAGGAAAAA
AAGGTCCTCCAGGAAATCGTGGTCCAATAGGAGATAATGGAGAAATGGGC
AAACCTGGTCCGGTCGGTGTGGTTGGAGATCCTGGAATTACTGGTCCAGT
AGGTCCACGAGGTGACCCTGGACCTCCTGGTAGTAAAGGAGCTGCAGGCA
GTCCAGGTAAAGATGGAACAAGTGGTGGGAAAGGATATGATGGCCAAGGA
GGAACTAGCGGAAAATCAGGAAAGTTAGGAACTCCTGGTGATAAAGGTGC
AGATGGTAAGCCTGGACCAGGCGGTGTACTTGGAGATGTTGGTTTGAGAG
GAGCAAAAGGAAAAAAAGGAAAAGATGGAGTACCAGGACGTGAAGGAAGA
CCGGGTCCTAGAGGTGCAAAAGGTGTAAAAGGACCAAAAGGAGTCCCTGG
ATATCCAGGATTAATAGGGATAGAAGGAGATCCAGGACCACCAGGTTTTC
CTGGTCCAGATGGGCCTGTTGGACCTAAGGGTCCTGATGGAATACAAGGT
CCAGTTGGAATAACTGGTCCCCGTGGTGATCCTGGAATACCAGGAGCTAT
TGGTCCAACTGGTCTTTCGGGTAAGAAAGGTTCAAATGGTCCTTTAGGCC
AATTGGGACCATTAGGACCTCCAGGTCCTCCTGGACCACCAGGTGGTATT
ATGGCTATGCAACTACGTTCTCCTACAAAGGGTGTTGTTTATGGAGATGA
TCCTGCTGCCGCTGAACTTCTTGGAAATAATGCCATTAAAAATCCTCAAG
GCACCAAAGAAGTTCCAGCTAGAACATGTAAACAGTTGAGTTCAGAAAAT
CCAAATATTCCTGATGGCGAATATTGGATAGATCCAAATGGTGGTAGGAC
AAGTGATGCTGTCAAAGTCTTTTGTAAAATATCTGAAGAAAAAACCTGTA
TTAAACCATTAAATCAAGAAATTAGTATTCGATCATGGAATAGTCCGATT
ACCAAAGGGCACACATGGTTGAAATCATTATTAAATTCAGACGAGATCCA
ATATTCAATACCAAATGGTCAACTCGCATTTCTCAAAATTTATAGTGACA
CAGCCACTCAAAGAATCACATTTGCTTGTGAAAATCATCCGATAATTGGC
AGTGAAGAAAAATTAAATAAAGTAATGGCACCCCGTTTATTAGCTGATGA
TGATACCATAATTAAAATGACACAATCAAAATTGAAATACACAGTTATCA
AAGATGAGTGTCAATATTCCAAATCCAGTGAAGCTGAAAGTATAATAGAA
TTGAAACATTTGGCGAGCTTACTACCGATTCGTGATATTGGCATCAATAT
TATCAATAATAGAAAGAGTAAATTTGGATTGACAATAGAAGAAGTCTGCT
TTTCATAA
back to top
Gene Groups
collagen is similar in sequence to the genes of this group: GG1008
Gene NameGene ID
SMU15002271SMU15002271
SMU15040033SMU15040033
Ddo_001537Ddo_001537
collagenDdo_024746
collagenGsp_012135
Gsp_016934Gsp_016934
Pgr_004907Pgr_004907
collagenPgr_010667
Pmo_000089Pmo_000089
collagenPmo_027332

Gene Group Protein Sequences

>SMU15002271

MWNSIFFSLLFVLCVSINARAEEKLNRLKRQATSPNQAKGSQGPRGDPGP
MGKPGPPGDPGALGPIGPPGRDGARGKSGKPGASGIPGKDGTPGSHGVVG
PIGPRGEPGLAGSIGPEGGTGPQGNRGLTGDKGDIGLAGLKGSNGEPGLQ
GPQGLRGPAGRVGPAGIQGPTGERGKQGTDGVPGSLGPQGAIGPPGQSGI
PGEIGNKGIRGESGIKGAKGDSGNPGLAGKTGPSGSLGPPGYPGVDGRPG
VRGEAGIVGPQGPVGKVGQRGQRGPSGNPGLSGPKGSQGEEGPIGIEGKQ
GSAGPKGQKGDPGRPGETGDEGPRGERGVVGPAGNKGSRGESGPDGSPGN
PGTDGIPGKDGLHGNPGSQGEVGSRGSPGAMGKLGLNGAPGPRGENGVFG
TNGHPGAKGAVGPKGNMGSPGVRGLPGNTGTMGVMGPGGIRGPLGPVGSP
GDKGSKGRTGITGVAGDSGDLGEQGPPGEDGSEGPSGSPGPMGFPGIAGK
IGQPGPIGPDGPPGPAGFEGTPGTNGKDGKAGKDGKPGPPGEVGPPGVKG
SIGPVGETGLMGKIGLRGPKGLDGIMGPPGTFGMNGPPGPSGEAGPSGAP
GKLGPVGITGSRGRPGPIGSSGEPGEKGPQGLPEVEKPLAGRSVGGVSGP
RGERGSPGDDGLPGPNGDPGPPGPIGMDGGRGDTGDRGLPGPAGDPGKDG
KPGEPGTPGIDGPPGQVGPEGPRGPSGETGEQGPQGLPGKPGDPGAEGPR
GSSGKQGFQGPTGPIGPTGKQGKQGRAGKSGKNGLTGRKGPAGQRGSIGA
RGKDGQTGENGRAGSAGADGFPGFPGPNGPPGPTGPDGKMGPIGPPGEIG
EHGEMGDPGPTGKEGLVGPQGNVGPPGPIGSPGNPGIAGPPGPQGKRGNR
GNTGFVGPAGPRGKRGPVGPGGEKGIPGSPGLEGQRGQIGVSGARGNNGN
NGKPGRTGRAGPAGTMGQKGVRGVKGFTGPNGLKGPQGPSGYPGEDGSPG
PMGLIGERGPQGITGKRGDRGDPGLLGPLGPPGNDGEFGRPGPQGPMGPP
GPPGPPGSSMPMGLRSPTKGLTFSDDPSVAHSFGNNAIITPRGTKEVPAR
SCKHLSEHNPDLSDGEYWIDPNGGRVSDAVPVYCRIATQQTCIKPISKIY
KTASWFKKYQKDHVWFQTINGIGEFEYDIENYQLNYLKALSETATQQISL
NCINQAIILDRQGKMSTVWTSLLGDDDTILSLQHPKRRFKVIKDECQYEK
FSEAETILEVRGKASRLPIKDVGLIIDSDRSRKVGIELGEVCYS*
>SMU15040033
MFKNSIFSGAILLILIYVDFSYGQFRTLNEATGPIGIRGNPGKRGKIGPD
GDPGSSGPPGPPGKDGLRGAPGPNGPAGGAGPDGKSGVTGNTGPPGSRGE
QGPPGPVGKEGLTGPNGYSGPSGEKGDSGSIGEQGDPGDIGPQGPAGPLG
PPGQSGPTGPQGTVGERGPHGPDGVVGPPGPEGRMGSPGSPGRPGELGKK
GEGGDEGLKGGKGENGKTGINGKSGKPGIRGPIGPVGINGKQGRKGELGD
IGLTGPQGLIGPRGVRGTVGNPGDNGPKGSQGDQGPIGLEGKPGPFGPKG
QKGDRGRPGPQGETGPLGPGGPIGPDGAKGSRGEIGPNGSPGTPGPNGKP
GATGPPGTPGHPGNAGPGGQAGPIGRPGPNGAPGPRGEIGPNGPDGKSGR
KGSLGPTGLTGLRGPQGNPGKDGTLGPLGTPGLRGPPGSIGTPGLKGNIG
PPGSKGKVGNAGKPGPLGKNGIDGSEGPIGNAGSPGFPGPNGDPGPNGPP
GSLGLAGLVGYPGNQGLAGKNGNPGVEGKPGKAGTPGSPGKPGVPGPVGD
VGNIGPPGPNGLKGAPGIYGVPGNYGPNGSPGDLGPLGKQGPEGLVGAPG
LAGSPGPDGPRGPPGANGSPGSAGGEGVEKHKPRSVGSAGIRGPEGKPGM
DGFPGDVGPPGKQGPEGGPGDPGDPGDMGPIGNSGDPGPDGKPGKNGAPG
KQGPEGLPGSEGASGPPGDVGDPGKSGPTGPRGGPGRIGSPGGQGKQGKS
GNQGKLGPSGKPGPVGSPGVTGLDGTIGRKGPLGLRGPSGPKGKSGAPGV
DGKVGTQGVNGYPGYLGPPGPPGPQGPNGKPGKPGPPGNVGQIGDHGEMG
NPGPQGSVGPPGATGPVGPRGDAGEPGRKGPIGPQGKNGTSGGKGYDGQS
GTSGKSGKVGTPGDKGGDGKPGPSGVLGDVGLRGAKGKKGKDGVPGREGR
PGARGSKGVKGPKGVPGYPGRPGVEGDPGPPGYPGPDGPVGPKGPDGLQG
PVGIIGPRGDPGIPGPIGPTGLHGKKGGIGIMGPVGPLGPPGPPGPPGGI
MAMQMRSPTKGVTYGDDPLAAELLGNNAIKNPEGTKEVPAITCKQLSVKH
PNLPDGEYWIDPNGGRVNDAVKVYCRISEQKTCIKPINNEISLRSWKSHS
ANGHTWLKSILNKEEIQYSIPNGQIAFLKVNSDSAVQRVTFTCENHPIIG
NEEKLNKVTAPRLLADDDTIIKMTHSHLKYTVIKDECQYSKSSEAESIIE
VRNYANLLPIRDIGVSIINNRKSKFGVTIEEVCFS*
>Ddo_001537 Ddo_001537
MHKFTFLFAIGVLCFSVFAENEKLHRLKRQATSPNQAKGSQGPRGDPGPM
GKPGPEGDSGMMGPVGPPGRDGARGKSGKPGLPGIPGKDGTPGSHGVSGP
MGPPGEIGVSGPVGPEGANGKQGNRGPTGDKGDMGLTGLKGSSGEPGLQG
PQGLRGPIGRVGPTGPTGPTGDRGKQGPDGIPGSVGPQGAIGPPGQPGIA
GEIGNKGVRGEAGIKGSKGDHGNPGMGGKLGSIGQPGPPGLPGVDGRPGI
RGEVGVPGLQGPEGKAGQRGQRGAPGNPGSQGPKGSQGEEGSPGIEGKPG
PPGIKGQKGDNGRPGENGDEGPRGERGLNGPSGNKGSQGESGPDGSPGNP
GTDGIPGKDGIAGSPGNEGEAGPKGPPGPMGKPGLNGAPGPRGENGVYGV
NGHPGAKGSVGPKGSIGSPGPRGLPGNNGPMGSMGPGGIRGPLGPVGSPG
EKGGKGRNGIPGASGDAGDIGELGPPGEDGSEGSSGSPGPAGFPGVSGKM
GQIGPIGVDGPPGPPGFEGTPGINGKDGKSGKDGKPGPPGEIGPPGINGS
TGPVGEIGPTGKIGLKGLKGADGIMGPPGTFGINGPPGLSGPMGPVGING
KRGPPGTPGKEGVPGAIGPQGIPGKRGSKGGDGVDKPLAGRSGGGPPGPK
GERGPQGEDGLSGPNGNPGPVGPPGIDGGRGDTGDRGLPGPIGDVGKDGK
PGEPGTPGQDGPPGPVGPEGPRGPPGETGDQGPQGLPGRPGESGLEGPRG
SPGKQGSQGSSGPVGAVGKPGKPGKPGKPGKDGMIGRKGPVGQRGPTGPR
GKDGPAGENGKPGSPGPDGFSGFPGSSGPPGPIGPDGKGGPPGPPGETGE
VGEMGDIGPLGKEGPQGPQGNDGPPGPIGSPGVPGNAGPPGPPGKRGNRG
NTGFVGGVGPRGKRGPVGADGEKGIPGQPGANGAKGQIGIPGPRGKLGNN
GKPGKIGRVGPPGTTGQKGTRGVKGFTGPNGLRGPQGLSGYPGEDGPPGP
IGLIGERGPQGVTGKRGDRGDPGAAGPNGPQGNDGEYGRPGPQGPVGPPG
PPGPPGSSMPMGLRSPTKGLTLADDPSVALSFGNNAIHTPRGTKDVPARS
CKLLSEINPTLPDGEYWIDPNGGRIDDAVKVYCKMSLAQTCIKPISKSFK
AKQWIRKYSKDHIWFQTTHEAGEFQYDIENYQLTYLKVHSDTATQQISLS
CINQAIVLDKNGKLSNASTSLLGDDDTILSLRHPKRRFKVVKDDCQYEKS
SEAETILEIRGKASRLPIRDVGLLVDDNPERKIGIELSEVCYS*
>Ddo_024746 collagen
MLKISILSGCVFLILLFIGNSYGQFRTLNEAVGPAGIRGNPGKPGKPGPD
GDPGPTGPPGPTGKDGLRGQPGAPGPNGNAGPDGKPGSTGVTGPPGARGE
QGPAGPVGKEGLSGAPGFVGPTGEKGDTGPPGEQGDLGDIGPPGRAGPIG
PPGQSGPTGPQGNVGERGPVGPDGMIGPPGPEGKIGSPGSPGRPGEVGKK
GEGGDEGIKGGKGEAGKTGLDGKSGKPGTRGPIGPVGINGRPGKKGELGD
IGLTGPQGISGPRGKRGRVGNPGEGGPKGSQGDQGPIGLEGRAGVLGPKG
QKGDRGRPGELGEAGPLGPIGPIGPDGVKGSRGESGPNGAPGDPGLNGKP
GAIGPPGIPGTPGDAGPGGQTGPMGRPGPNGAPGPRGEVGPNGPDGKSGN
KGAIGPTGQPGVRGSIGNPGKDGALGPMGTSGMRGPPGNIGVEGSKGNKG
PPGSRGKVGSAGKVGPPGKPGVNGSEGPVGNMGNPGFPGSSGDPGAPGPK
GPVGLIGPVGYAGSQGVNGKNGDPGNVGKPGKVGPPGAPGKPGIPGPDGD
RGSLGPPGPTGLKGSPGIIGVPGNYGPNGPPGSLGVIGKQGLEGELGPPG
TPGSPGGTGPRGPPGPAGTPGNAGGEGVDKQKPRSVGSAGIRGPIGKPGA
DGIPGDVGPVGKQGSEGPPGDPGDPGDMGSVGAIGDSGKDGNPGKPGAAG
KPGPSGVDGPEGPAGLPGDIGEPGKSGPSGPRGDPGRVGSAGGPGKQGKS
GNQGKQGPSGKPGPTGPPGLAGLDAANGRKGPLGLRGPPGPKGKSGGPGI
DGKVGTPGTDGFPGYLGPPGQPGPQGIAGKPGKPGPTGGIGLQGDHGEMG
KPGPQGPLGVPGLTGPVGPRGDPGPPGSKGAAGKPGKDGTSGGKGYDGQT
GTSGKSGKIGAPGEKGADGKPGPNGLLGDTGLRGAKGKKGTDGTPGREGK
PGPRGAKGTKGPKGVPGYPGRPGIEGDPGPPGYSGPDGPTGPKGPDGLQG
PVGIFGPRGDPGIPGAIGPTGPPGKPGGTGPAGQRGPLGPPGPPGPPGGV
MAMQMRSPTKGVVYGDDPKAAELLGNNAIKNPQGTKEVPARTCKHLSSVN
PQAPDGEYWIDPNGGRIEDAVKVYCKISEQKTCIKSIDNNLSIRSWNSPI
LNGPTWLQKLINKNEIQYSVPNNQLAFLKVYSDNAVQRVTFNCENHPIIG
NEKIFNKVTAPRLLADDDSIIKMTHPKLKYSVIKDECQYSKSSEAESVIE
VKDVANLLPIRDVGIHIINNRKSKFGLTIEDVCFS*
>Gsp_012135 collagen
MLKISIISGAILLVLIYIECVHGQFRTLNEATGPMGIRGNPGKRGKPGPD
GDPGAQGPTGPPGNDGQRGAPGLQGPPGSVGPEGKSGAGGVSGPPGSRGE
QGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEQGDTGDIGPPGPPGPIG
QPGQNGPTGPQGPVGERGTHGPDGIVGPPGPEGRIGSPGLPGRPGEIGKK
GEGGDEGAKGGKGESGKTGINGKSGKPGLRGRTGPDGNNGKQGRKGEIGD
VGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKG
QKGDRGRPGPQGEAGLIGPTGPIGPDGVKGSRGESGPNGSPGSPGPNGKP
GADGPPGNPGHPGNSGPGGSTGALGRPGPNGAPGPRGEIGPPGPDGKSGN
KGALGPIGPSGLRGPPGNPGKDGMIGPLGLPGLRGPPGNVGAPGNKGNIG
EPGPKGKTGNSGKPGPAGKNGADGSEGLSGNIGSPGFPGPNGDPGASGPP
GPAGAIGIPGYPGNQGISGKNGAPGTEGNPGKSGPVGPVGKAGVPGPIGD
SGNPGPPGPNGLKGAPGIVGIPGNYGPNGPPGALGPLGKQGPEGPVGAPG
LLGSPGPDGPRGPPGDNGSPGSAGGEGVEKHKPRSVGAAGIRGPVGKPGM
DGLPGDLGPPGKQGPEGNPGDPGDPGDMGPIGASGDSGPDGKPGKPGAPG
KQGPEGLSGPEGASGPPGDVGDPGKSGPGGPRGGPGRQGSPGGPGKQGKS
GIQGKPGPAGKPGPLGPQGVTGLDGSGGRKGPLGLRGPVGPKGKPGAPGT
DGKVGTSGPDGYPGYLGPPGPSGPQGITGKPGKKGPPGNRGPIGDNGEMG
KPGPVGVVGDPGITGPVGPRGDPGPPGSKGAAGSPGKDGTSGGKGYDGQG
GTSGKSGKLGTPGDKGADGKPGPGGVLGDVGLRGAKGKKGKDGVPGREGR
PGPRGAKGVKGPKGVPGYPGLIGIEGDPGPPGFPGPDGPVGPKGPDGIQG
PVGITGPRGDPGIPGAIGPTGLSGKKGSNGPLGQLGPLGPPGPPGPPGGI
MAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSEN
PNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPI
TKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIG
SEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIE
LKHLASLLPIRDIGINIINNRKSKFGLTIEEVCFS*
>Gsp_016934 Gsp_016934
MYNLIRLFIAILCVLSVPVIVNGEAKLNRIKRQATSPRQAKGSQGPRGDP
GPMGKPGPQGDPGPLGPVGPPGRDGARGKSGRPGASGVPGKDGTPGSHGV
IGPIGPRGEPGIAGSIGPEGASGPQGNRGYPGDKGDIGLNGLKGSNGEPG
LQGPQGIRGTSGRVGPTGPQGPPGERGKQGPDGIPGSPGPQGTIGPPGQS
GITGEMGNKGIRGEGGIKGSKGDVGNPGLSGKVGPAGPTGPPGFPGVDGR
PGVRGESGVVGQQGPVGKVGQQGQRGKTGNIGPPGPKGAQGDEGPMGIEG
KQGPPGPKGQKGDPGRPGETGDEGPRGERGIVGPSGNKGARGETGQDGSP
GNPGTDGIPGKDGLHGSPGSEGEVGSVGPPGEVGKPGLNGAPGIRGENGA
FGVNGKPGAKGSVGPKGNIGSPGPRGLSGNNGAMGVMGPGGIRGPIGPVG
SPGEKGGKGRVGIPGVSGDAGDIGEQGPPGEDGAEGSSGNPGPIGFPGIA
GKIGQAGPLGVDGPPGPPGFEGLPGVNGKDGKAGKDGKPGPPGEVGPPGV
KGSIGQFGETGPMGKIGIRGPKGMDGIMGPPGVFGMNGPPGPSGEAGPEG
APGKLGPVGIAGRPGRPGAIGPQGIAGEKGPKGLPEVDKPLAGRSSGGPP
GPKGERGPPGDDGLPGPNGNQGPPGPVGMDGGRGDTGDRGLPGPVGDPGK
DGKPGEQGVPGTDGPQGPLGPEGPRGPSGETGEQGPRGLPGKPGEPGGEG
PRGSPGKQGLQGPTGPVGPTGKQGKQGKPGKSGKNGVTGQKGPVGSRGPL
GPRGKDGPLGESGKPGSPGADGFSGFPGPIGPPGAIGPDGKQGPVGQPGE
PGEHGEMGDAGPNGNEGPLGPQGNVGPPGPTGPPGNPGNIGTPGPQGKRG
NRGNTGFVGAAGQRGKRGPLGPPGDKGIPGSPGMAGLKGQTGVQGPRGKN
GNNGKPGRIGRSGPPGSMGQKGVRGVKGFTGPNGLKGPQGSSGYPGEDGP
PGPIGLIGERGPQGITGKRGDRGDPGLLGPPGPLGNDGEYGRPGPQGPMG
PPGPPGPPGSSMPMGLRSPTKGLTMADDPTAAISFGNNAIITPRGTKDVP
ARSCKHLSEHNPDLPDGEYWIDPNGGRIDDAIPVFCRIELQQTCIKPISK
IFKPVQWIRKYARGHIWFQTVNEIGEFEYDIESHQLNFLKALSETATQQI
TFTCINQAIILDKTGKLSNASTKLLGDDDTFLTFHHPKRRYKVVKDECQY
EKFSEAETVLQIQGKASRLPIKDVGLITDSNTSRKVGIELSEVCYS*
>Pgr_004907 Pgr_004907
MLRLVCFFLISSVCLTTVLSKEELINRIKRQATSPNQAIGPQGPRGDPGP
RGKIGQIGDTGAIGPPGPPGRDGAAGKSGKPGPPGLPGKDGRPGTPGNSG
ARGTRGEPGVAGPLGPEGGTGKQGSKGAPGDKGDIGPNGLKGSNGEAGLQ
GIRGPRGSPGPVGPNGPPGSAGDRGKRGPDGRMGGAGPQGPIGPPGIPGL
LGDMGDKGERGDGGPKGVKGEAGNAGVSGKPGPTGPKGPQGSPGVDGRPG
IRGELGNIGPLGPPGVAGPRGQTGAPGNSGKLGPKGAQGEQGPVGLEGKI
GLIGDKGQKGDQGRPGDIGDPGVRGERGVEGIGGNKGPRGEMGPDGSPGN
GGTDGIPGRDGEQGQPGSIGPVGPRGPQGPQGKTGLNGSPGGRGENGPFG
LNGMPGGKGEPGVRGPPGLPGSRGSAGNNGLQGVMGPVGMRGPAGQVGEG
GDKGSPGTAGIAGMPGEAGDVGLQGPPGEEGVEGPSGSPGPAGFSGAVGK
IGQVGPIGINGPPGPPGYEGAPGVNGKDGKSGKDGKPGTPGDTGPVGPKG
ASGPLGERGPLGPLGIKGIKGFEGVMGIPGSFGTNGPPGLSGNIGAEGPV
GKPGTAGLPGLAGAPGQRGPIGDSGEPGSSGAQGVEKPPAGRSSGPPGPK
GNRGPPGDNGTPGAVGTQGPPGPPGIDGDRGDPGDRGLPGPIGEPGKDGK
LGKVGGSGKNGPPGQDGPEGPRGAPGDVGEPGRQGPEGKPGPDGPEGPRG
IPGLQGKRGPSGPIGPIGPPGKQGKTGSPGSDGVIGRKGPPGARGPQGPR
GKDGVAGENGKSGTPGPDGFAGFPGPSGPQGIIGGVGKPGNPGSPGETGE
QGEFGDIGPPGPPGPDGLNGNPGPPGPSGIPGSIGERGPPGPGGKKGPMG
NTGFIGMPGPRGGRGKPGLIGPMGLVGGVGLQGPKGPPGEPGDRGKDGVD
GKPGGPGRPGPTGSIGIKGARGFKGFTGPTGIRGNPGPSGYPGENGMPGP
MGLIGERGPQGAPGQRGERGDTGPAGMIGPQGNDGEYGRPGPQGPMGPSG
PPGPPGSSTLMNLRAPKKNLIYGDDPSSALILGNSVIKTPRGTKEVPART
CKHLAEHNPKIPDGEYWIDPNGGRIQDAVKVFCKIDNYQTCIESTTKKFK
LQSWIRKYPKGPIWFQTTNNVGDFEYDVSDDQLSYLKALSDKASQQITIK
CFNKAIIRDKNGALLKTSPSLLGDDDTILTLEHPKRQYTIVRDDCQYEKN
SEAETVLEVNGKASRLPIKDIGLVIDSDKKVGIELSEVCYS*
>Pgr_010667 collagen
MLTNTFLIGVFLGILSSTEFVSGQFKSLQEAVGPPGIRGNPGPQGKPGPD
GDDGAAGPTGFMGKDGFRGPPGPVGPAGKPGTDGKAGNAGVTGSPGSRGE
PGLTGGMGQEGVAGPPGYPGLIGDKGDPGLPGEQGDVGPIGPVGPVGPKG
PIGQPGPTGKQGGTGNRGITGPDGKVGPPGPQGRMGSQGPPGNPGEMGNK
GEKGSEGRKGGKGSVGKPGLNAKPGKLGPTGQAGTPGKDGQQGRRGNPGN
IGVTGSQGPAGQRGPRGLIGNPGEVGPKGSEGDQGPIGVEGKPGNKGPNG
QKGDRGRPGQIGDNGPEGPIGVVGKPGTKGVRGEIGPNGRPGTPGQNGAP
GKNGPPGTPGNPGGTGPAGQPGSLGRPGPNGTPGPRGEPGPSGNDGTPGA
KGSPGPPGPPGQTGLAGNLGKDGDMGKSGPAGIRGPPGELGPDGAKGKTG
SPGLRGIPGDNGPVGKQGPPGLKGNDGNEGGQGSPGFAGPSGEVGKLGPV
GIPGAAGQRGYPGELGVHGKNGAPGSNGKPGKPGADGSVGKIGAPGPDGD
RGDSGPIGARGVTGQVGILGVPGSYGPNGASGPDGILGKPGPSGKPGEDG
APGIPGSPGPKGPPGEGGDPGNPGMPRISKPQPRGAGPAGPKGEAGKPGL
DGIPGEIGPSGKPGAPGPDGDPGDPGETGAPGQDGEEGKEGAPGAPGKPG
LLGPQGLEGNIGSPGPSGEAGDPGATGAPGPVGAPGRDGSLGLPGRQGKP
GEPGKEGAVGKVGSPGVIGAPGNNGADGRKGPLGPRGPPGDKGKPGGPGT
EGKPGSDGSPGDPGFMGSPGPSGPQGSLGKPGKPGPAGSDGLPGDYGEMG
VPGPAGPDGAPGNSGTAGARGPSGLPGGKGKPGPAGKAGKPGNGGFAGIP
GARGASGTPGPAGDAGPLGAPGSNGAIGETGLAGDKGVSGANGKPGQAGR
PGPRGPAGTKGPKGSPGFDGPTGADGDRGAIGYPGPNGPPGPSGPAGLTG
PTGVIGPRGDPGVLGPVGVAGLPGRQGESGPIGQRGPIGPPGPPGPPGGV
MAMQMRSPTKGIMYGDDSKTAKLLGGNAIKNPQGTKDIPAKTCKHLSVSY
PNKPDGEYWIDPNSGRIDDAVKVFCKISAQQTCIRPINGETPLKTWPSSS
VNGHSWIQALTMNNEVQYSIPNSQLSFLKAYSDSAIQRVTFSCDNHPIIG
INGKINKVTSPRLLADDDSVMRINHQKLKYTVISDDCQYGKASEAETIIE
VSNDASLFPIRDVGVTIISKSNSKFGLKFEEVCFS*
>Pmo_000089 Pmo_000089
MLRIVSLILICAIYIIAVMANEELVQRVKRQATSPNQAIGPQGPRGDPGP
RGKAGIPGDTGVLGPPGPPGRDGAAGKSGQSGPSGLPGKDGRPGTPGNVG
SRGPRGEAGVPGPIGPEGGTGKQGSKGPPGEKGDIGPNGLKGSNGESGLQ
GIRGPRGPPGPIGPNGPSGSAGERGKPGPDGRVGGAGPQGPIGPPGVPGL
PGDMGDKGERGDNGPKGVKGEAGNAGLSGKSGPIGPKGPQGSPGVDGRPG
IRGELGNVGPTGPPGSGGPRGQTGAPGNSGHMGPKGAQGEQGPVGLEGKL
GPAGMKGQKGDQGRPGDIGDPGLRGERGVGGIGGNKGPRGEMGPEGSPGN
SGTDGIPGRDGEPGQPGSAGPPGPRGPQGSQGNTGLNGSPGARGENGPFG
SNGIPGNKGEPGVKGPPGLPGSRGSTGNNGVQGIIGPTGIRGPAGQVGEG
GDKGVPGSTGIGGVPGEAGDAGIQGPPGEEGNEGPSGSPGPAGFSGSVGK
IGQTGPVGIDGPPGPPGYEGAPGPNGKDGKSGKDGKPGVPGDTGPSGQKG
TSGPVGERGPSGPSGVKGVKGFDGIMGIPGSFGANGPPGLSGNFGAEGPV
GKSGAPGLPGLPGTQGQRGPIGDSGDPGSSGAQGVEKPPAGRSAGPPGPK
GNRGPPGDNGQPGATGVQGPPGPPGIDGDRGDPGDRGLPGPIGEPGKDGK
AGKVGVAGKDGPPGQDGPEGPRGAPGDVGEPGRQGPEGKPGLDGPEGPRG
QSGLQGKRGPSGNVGPIGKPGKQGKTGSPGTDGSLGRKGPAGARGPQGPR
GKDGPVGESGKTGSPGSDGFAGFPGPSGPQGAIGPEGKPGISGSPGEAGE
QGEFGDIGVPGPPGNDGLPGNAGPPGPMGVPGSLGERGPPGPIGKKGPMG
NTGFVGMPGPRGGRGKPGLLGPMGLVGGVGLQGPKGPPGEPGERGKDGVD
GKAGGAGRPGPPGNLGIKGARGLKGFTGPTGMRGNSGPSGYPGEDGMQGP
IGLIGERGPQGIAGARGERGDTGPSGLIGPTGNDGEYGRPGPQGPMGPVG
PPGPPGSATLMNLRTPKKNLIYGDDPSAALILGNSVIKTPRGTKEVPART
CKHLAEHNPKLTDGEYWIDPNGGRIADAVKVFCKIDKYQTCIESSTKKFK
LQTWVRKYPKGPIWFQTTNGIGEFEYDVPDDQLSYLKALSDKASQQITVK
CMAQAIIRDKNGQLLKSTPSLLGDDDTILTLDHPKRQYTIVKDDCQYEKN
SDAETILEMSGKSSRLPIKDIGLVIDSDKKVGIELSEVCFS*
>Pmo_027332 collagen
MGKDGVRGPPGLIGPAGKPGSDGKAGVAGVAGSPGPRGEPGLTGSMGQEG
VAGPPGYPGLIGEKGDPGLPGEMGDVGNIGPTGIVGPKGPIGQPGPSGKQ
GITGNRGITGPDGKVGPPGPQGKMGPKGPPGNPGEMGNKGEKGSEGRKGG
KGSSGKPGLNAKPGKLGPIGIAGTPGKDGQQGRRGNPGNIGVMGSQGPSG
QRGSRGIIGNPGEVGPKGSEGDQGPIGLEGKPGNKGPNGLKGDRGRPGQI
GENGPEGPLGMEGKPGTKGVRGEIGPNGSPGTPGQNGVLGKNGPPGTSGN
PGGTGPTGQPGPLGRPGPNGTPGPRGEPGPGGNDGMPGAKGSPGPPGPPG
QPGLAGILGKDGDLGKSGPAGIRGPPGELGPAGGKGKIGSPGLRGIPGDT
GPVGKQGLAGQKGNDGHEGGQGTPGFVGPSGEPGKSGPVGSPGAGGQRGF
PGELGVHGKNGAPGNNGKPGKPGAPGPVGIIGAPGPDGDRGDNGPIGARG
ITGPSGVLGVPGTYGPNGAPGPEGTLGKQGPIGKSGLDGIPGLPGLPGPK
GPPGESGDPGPPGNPRISKPQPRGAGPAGPKGETGKPGLDGVPGEVGPSG
KPGATGPDGDAGDPGDAGAPGPDGEEGKSGALGAPGKPGLLGPMGLEGSV
GAPGPAGEAGDSGISGAPGPVGLPGRDGNLGLPGKQGKQGDPGKEGPVGK
PGGLGVPGVPGMNGANGRKGPLGLRGPQGDKGKTGAPGTDGKPGPDGTTG
DPGVMGSPGPSGPQGSLGKPGKPGLPGSEGLTGDYGELGVPGPVGMDGPP
GNPGTVGAPGPPGGPGGKGKPGAKGKPGKAGSGGFPGVPGSRGALGKPGP
SGDPGDAGELGPNGALGEAGLVGDKGVSGANGKPGQAGRPGSRGLPGVKG
PKGSPGFDGPTGSDGDRGPTGFSGPDGPPGPSGTEGPTGPTGVIGPRGDP
GVLGPIGVPGLPGRQGESGPIGQRGPIGPPGPPGPPGGVMAMQMRSPTKG
IMYGDDSMTAKLLGGNVIKNPQGTKDVPAKTCKHLSVSNPNKPDGEYWID
PNSGRIDDAVKVFCKISKQQTCIKPIDGDIAIKKWHTTNFNGHSWIHTLT
KKNEVKYSIPNSQLSFLKAYSEKAIQRVTFTCDNHPIIGNDGKINKVTSP
RLLADDDSILRINHPKLKYTVITDDCQYGKSSEAETVIEVSNDANIFPIR
DVGVTIISKNDSQFGLKFEEVCFS*

Created by

Powered By

Admin Log In

Education - This is a contributing Drupal Theme
Design by WeebPal.