collagen, Gsp_012135 (mRNA) Girardia sp.

Overview
Namecollagen
Unique NameGsp_012135
TypemRNA
OrganismGirardia sp. (Girardia sp.)
Sequence length8270
Analyses
This mRNA is derived from or has results from the following analyses
Analysis NameDate Performed
Girardia sp. Transcriptome2016-03-03
Girardia Sp Translation2016-04-28
Girardia Sp. BLASTX Human2016-03-08
Girardia Sp. BLASTX Swissprot Uniprot2016-03-08
Girardia Sp. BLASTX Drosophila melanogaster2016-03-09
Girardia Sp. BLASTX Schmidtea mediterranea2016-03-09
Girardia Sp CDS2016-05-06
Homology
BLAST of collagen vs. RefSeq Human
Match: gi|767973237|ref|XP_011536230.1| (PREDICTED: collagen alpha-1(II) chain isoform X1 [Homo sapiens])

HSP 1 Score: 117.472 bits (293), Expect = 4.84939e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = 3
Query: 7530 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
+I++P+G+++ PARTC+ L +P G+YWIDPN G T DA+KVFC + +TC+ P + ++W S + K H W +N D + + N Q+ FL++ S +Q IT+ C+N E N A + +D I+ S+ YT +KD C +++IE + S LPI DI I + +FG+ I VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534

HSP 2 Score: 117.472 bits (293), Expect = 4.84939e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = -3
Query:  121 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 741
+I++P+G+++ PARTC+ L +P G+YWIDPN G T DA+KVFC + +TC+ P + ++W S + K H W +N D + + N Q+ FL++ S +Q IT+ C+N E N A + +D I+ S+ YT +KD C +++IE + S LPI DI I + +FG+ I VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534
BLAST of collagen vs. RefSeq Human
Match: gi|767973239|ref|XP_011536231.1| (PREDICTED: collagen alpha-1(II) chain isoform X1 [Homo sapiens])

HSP 1 Score: 117.472 bits (293), Expect = 4.84939e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = 3
Query: 7530 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
+I++P+G+++ PARTC+ L +P G+YWIDPN G T DA+KVFC + +TC+ P + ++W S + K H W +N D + + N Q+ FL++ S +Q IT+ C+N E N A + +D I+ S+ YT +KD C +++IE + S LPI DI I + +FG+ I VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534

HSP 2 Score: 117.472 bits (293), Expect = 4.84939e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = -3
Query:  121 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 741
+I++P+G+++ PARTC+ L +P G+YWIDPN G T DA+KVFC + +TC+ P + ++W S + K H W +N D + + N Q+ FL++ S +Q IT+ C+N E N A + +D I+ S+ YT +KD C +++IE + S LPI DI I + +FG+ I VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534
BLAST of collagen vs. RefSeq Human
Match: gi|767973241|ref|XP_011536232.1| (PREDICTED: collagen alpha-1(II) chain isoform X1 [Homo sapiens])

HSP 1 Score: 117.472 bits (293), Expect = 4.84939e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = 3
Query: 7530 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
+I++P+G+++ PARTC+ L +P G+YWIDPN G T DA+KVFC + +TC+ P + ++W S + K H W +N D + + N Q+ FL++ S +Q IT+ C+N E N A + +D I+ S+ YT +KD C +++IE + S LPI DI I + +FG+ I VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534

HSP 2 Score: 117.472 bits (293), Expect = 4.84939e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = -3
Query:  121 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 741
+I++P+G+++ PARTC+ L +P G+YWIDPN G T DA+KVFC + +TC+ P + ++W S + K H W +N D + + N Q+ FL++ S +Q IT+ C+N E N A + +D I+ S+ YT +KD C +++IE + S LPI DI I + +FG+ I VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534
BLAST of collagen vs. RefSeq Human
Match: gi|767973243|ref|XP_011536233.1| (PREDICTED: collagen alpha-1(II) chain isoform X1 [Homo sapiens])

HSP 1 Score: 117.472 bits (293), Expect = 4.84939e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = 3
Query: 7530 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
+I++P+G+++ PARTC+ L +P G+YWIDPN G T DA+KVFC + +TC+ P + ++W S + K H W +N D + + N Q+ FL++ S +Q IT+ C+N E N A + +D I+ S+ YT +KD C +++IE + S LPI DI I + +FG+ I VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534

HSP 2 Score: 117.472 bits (293), Expect = 4.84939e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = -3
Query:  121 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 741
+I++P+G+++ PARTC+ L +P G+YWIDPN G T DA+KVFC + +TC+ P + ++W S + K H W +N D + + N Q+ FL++ S +Q IT+ C+N E N A + +D I+ S+ YT +KD C +++IE + S LPI DI I + +FG+ I VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534
BLAST of collagen vs. RefSeq Human
Match: gi|767973245|ref|XP_011536234.1| (PREDICTED: collagen alpha-1(II) chain isoform X1 [Homo sapiens])

HSP 1 Score: 117.472 bits (293), Expect = 4.84939e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = 3
Query: 7530 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
+I++P+G+++ PARTC+ L +P G+YWIDPN G T DA+KVFC + +TC+ P + ++W S + K H W +N D + + N Q+ FL++ S +Q IT+ C+N E N A + +D I+ S+ YT +KD C +++IE + S LPI DI I + +FG+ I VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534

HSP 2 Score: 117.472 bits (293), Expect = 4.84939e-25
Identity = 71/219 (32.42%), Postives = 110/219 (50.23%), Query Frame = -3
Query:  121 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPIT--KGHTWLKSLLNS--------DEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 741
+I++P+G+++ PARTC+ L +P G+YWIDPN G T DA+KVFC + +TC+ P + ++W S + K H W +N D + + N Q+ FL++ S +Q IT+ C+N E N A + +D I+ S+ YT +KD C +++IE + S LPI DI I + +FG+ I VCF
Sbjct: 1316 SIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534
BLAST of collagen vs. uniprot
Match: gi|18202526|sp|Q28668|CO1A2_RABIT (RecName: Full=Collagen alpha-2(I) chain; AltName: Full=Alpha-2 type I collagen; Flags: Precursor)

HSP 1 Score: 129.798 bits (325), Expect = 1.69303e-29
Identity = 115/315 (36.51%), Postives = 159/315 (50.48%), Query Frame = 3
Query: 7293 QGPVGITGPRGD---PGIPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXXIM----------AMQLRSPT--KGVVYGDDPAAAELLGNNAIKN---PQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSW-NSPITKGHTWLKSLLNSD-EIQYSIPN-------GQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDT-IIKMTQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
+GP G TGP G G PG +GP GL G +GS GP G GP GPPGPPG GG A Q RSP + Y D L NN I+ P+G+++ PARTC+ L +P G YWIDPN G T DA+KV+C S +TCI+ + IS+++W S K H WL +N + +Y++ QLAF+++ ++ A+Q IT+ C+N EE N A L +D ++ S+ YTV+ D C + ++IIE K + S LP DI I +F + + VCF
Sbjct: 213 RGPAGPTGPAGKDGRSGHPGTVGPAGLRGSQGSQGPAGPPGPPGPPGPPGASGGGYDFGYDGDFYRADQPRSPPSLRPKDYEVDATLKSL--NNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCIRAQPENISVKNWYKSSKAKKHVWLGETINGGTQFEYNVEGVTSKEMATQLAFMRLLANHASQNITYHCKNSIAYMDEETGNLNKAVILQGSNDVELVAEGNSRFTYTVLVDGCTKKTNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFYVDVGPVCF 525

HSP 2 Score: 129.798 bits (325), Expect = 1.69303e-29
Identity = 115/315 (36.51%), Postives = 159/315 (50.48%), Query Frame = -3
Query:  121 QGPVGITGPRGD---PGIPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXGIM----------AMQLRSPT--KGVVYGDDPAAAELLGNNAIKN---PQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSW-NSPITKGHTWLKSLLNSD-EIQYSIPN-------GQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDT-IIKMTQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 978
+GP G TGP G G PG +GP GL G +GS GP G GP GPPGPPG GG A Q RSP + Y D L NN I+ P+G+++ PARTC+ L +P G YWIDPN G T DA+KV+C S +TCI+ + IS+++W S K H WL +N + +Y++ QLAF+++ ++ A+Q IT+ C+N EE N A L +D ++ S+ YTV+ D C + ++IIE K + S LP DI I +F + + VCF
Sbjct: 213 RGPAGPTGPAGKDGRSGHPGTVGPAGLRGSQGSQGPAGPPGPPGPPGPPGASGGGYDFGYDGDFYRADQPRSPPSLRPKDYEVDATLKSL--NNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCIRAQPENISVKNWYKSSKAKKHVWLGETINGGTQFEYNVEGVTSKEMATQLAFMRLLANHASQNITYHCKNSIAYMDEETGNLNKAVILQGSNDVELVAEGNSRFTYTVLVDGCTKKTNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFYVDVGPVCF 525
BLAST of collagen vs. uniprot
Match: gi|115286|sp|P02460|CO2A1_CHICK (RecName: Full=Collagen alpha-1(II) chain; AltName: Full=Alpha-1 type II collagen; Flags: Precursor, partial [Gallus gallus])

HSP 1 Score: 118.242 bits (295), Expect = 3.46806e-26
Identity = 72/220 (32.73%), Postives = 111/220 (50.45%), Query Frame = 3
Query: 7530 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKG--HTWLKSLLNSDEIQYSI------PNG---QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMT-QSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
+I++P+G+K+ PARTC+ + +P G+YWIDPN G T DA+KVFC + +TC+ P I ++W + TK H W +N +S PN Q+ FL++ S +Q +T+ C+N EE N A + +D I+ S+ Y+V++D C +++IE + S LPI DI I +FG+ I VCF
Sbjct: 150 SIRSPEGSKKNPARTCRDIKLCHPEWKSGDYWIDPNQGCTLDAIKVFCNMETGETCVYPTPSSIPRKNWWTSKTKDKKHVWFAETINGG-FHFSYGDENLSPNTASIQMTFLRLLSTEGSQNVTYHCKNSIAYMDEETGNLKKAILIQGSNDVEIRAEGNSRFTYSVLEDGCTKHTGKWGKTVIEYRSQKTSRLPIVDIAPMDIGGADQEFGVDIGPVCF 368

HSP 2 Score: 118.242 bits (295), Expect = 3.46806e-26
Identity = 72/220 (32.73%), Postives = 111/220 (50.45%), Query Frame = -3
Query:  121 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKG--HTWLKSLLNSDEIQYSI------PNG---QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMT-QSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 741
+I++P+G+K+ PARTC+ + +P G+YWIDPN G T DA+KVFC + +TC+ P I ++W + TK H W +N +S PN Q+ FL++ S +Q +T+ C+N EE N A + +D I+ S+ Y+V++D C +++IE + S LPI DI I +FG+ I VCF
Sbjct: 150 SIRSPEGSKKNPARTCRDIKLCHPEWKSGDYWIDPNQGCTLDAIKVFCNMETGETCVYPTPSSIPRKNWWTSKTKDKKHVWFAETINGG-FHFSYGDENLSPNTASIQMTFLRLLSTEGSQNVTYHCKNSIAYMDEETGNLKKAILIQGSNDVEIRAEGNSRFTYSVLEDGCTKHTGKWGKTVIEYRSQKTSRLPIVDIAPMDIGGADQEFGVDIGPVCF 368
BLAST of collagen vs. uniprot
Match: gi|82202407|sp|Q6P4Z2|CO2A1_XENTR (RecName: Full=Collagen alpha-1(II) chain; AltName: Full=Alpha-1 type II collagen; Flags: Precursor)

HSP 1 Score: 122.094 bits (305), Expect = 4.16151e-26
Identity = 78/221 (35.29%), Postives = 112/221 (50.68%), Query Frame = 3
Query: 7530 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKG----HTWLKSLLNSD-EIQY----SIPNG---QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
+I++P GTK+ PARTC+ L +P G+YWIDPN G T DA+KVFC + +TC+ P +I ++W S KG H W +N + Y S PN Q+ FL++ S ATQ IT+ C+N E N A L +D I+ S+ Y ++D C+ ++++IE + S LPI DI I +FG+ I VCF
Sbjct: 1273 SIRSPDGTKKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTVDAIKVFCNMETGETCVYPNPSKIPKKNWWS--AKGKEKKHIWFGETINGGFQFSYGDDSSAPNTANIQMTFLRLLSTDATQNITYHCKNSIAFMDEASGNLKKAVLLQGSNDVEIRAEGNSRFTYNALEDGCKKHTGKWSKTVIEYRTQKTSRLPIVDIAPMDIGGADQEFGVDIGPVCF 1491

HSP 2 Score: 122.094 bits (305), Expect = 4.16151e-26
Identity = 78/221 (35.29%), Postives = 112/221 (50.68%), Query Frame = -3
Query:  121 AIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKG----HTWLKSLLNSD-EIQY----SIPNG---QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 741
+I++P GTK+ PARTC+ L +P G+YWIDPN G T DA+KVFC + +TC+ P +I ++W S KG H W +N + Y S PN Q+ FL++ S ATQ IT+ C+N E N A L +D I+ S+ Y ++D C+ ++++IE + S LPI DI I +FG+ I VCF
Sbjct: 1273 SIRSPDGTKKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTVDAIKVFCNMETGETCVYPNPSKIPKKNWWS--AKGKEKKHIWFGETINGGFQFSYGDDSSAPNTANIQMTFLRLLSTDATQNITYHCKNSIAFMDEASGNLKKAVLLQGSNDVEIRAEGNSRFTYNALEDGCKKHTGKWSKTVIEYRTQKTSRLPIVDIAPMDIGGADQEFGVDIGPVCF 1491
BLAST of collagen vs. uniprot
Match: gi|146286085|sp|Q91717|CO2A1_XENLA (RecName: Full=Collagen alpha-1(II) chain; AltName: Full=Alpha-1 type II collagen; Flags: Precursor [Xenopus laevis])

HSP 1 Score: 120.939 bits (302), Expect = 1.21158e-25
Identity = 80/226 (35.40%), Postives = 113/226 (50.00%), Query Frame = 3
Query: 7524 NNAIKN---PQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKG----HTWLKSLLNSD-EIQY----SIPNG---QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
NN I+N P GTK+ PARTC+ L +P G+YWIDPN G T DA+KVFC + +TC+ P +I ++W S KG H W +N + Y S PN Q+ FL++ S A+Q IT+ C+N E N A L +D I+ S+ Y ++D C+ ++++IE + S LPI DI I +FG+ I VCF
Sbjct: 1262 NNQIENIRSPDGTKKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTVDAIKVFCDMETGETCVYPNPSKIPKKNWWS--AKGKEKKHIWFGETINGGFQFSYGDDSSAPNTANIQMTFLRLLSTDASQNITYHCKNSIAFMDEASGNLKKAVLLQGSNDVEIRAEGNSRFTYNALEDGCKKHTGKWSKTVIEYRTQKTSRLPIVDIAPMDIGGADQEFGVDIGPVCF 1485

HSP 2 Score: 120.939 bits (302), Expect = 1.21158e-25
Identity = 80/226 (35.40%), Postives = 113/226 (50.00%), Query Frame = -3
Query:  121 NNAIKN---PQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKG----HTWLKSLLNSD-EIQY----SIPNG---QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKM-TQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 747
NN I+N P GTK+ PARTC+ L +P G+YWIDPN G T DA+KVFC + +TC+ P +I ++W S KG H W +N + Y S PN Q+ FL++ S A+Q IT+ C+N E N A L +D I+ S+ Y ++D C+ ++++IE + S LPI DI I +FG+ I VCF
Sbjct: 1262 NNQIENIRSPDGTKKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTVDAIKVFCDMETGETCVYPNPSKIPKKNWWS--AKGKEKKHIWFGETINGGFQFSYGDDSSAPNTANIQMTFLRLLSTDASQNITYHCKNSIAFMDEASGNLKKAVLLQGSNDVEIRAEGNSRFTYNALEDGCKKHTGKWSKTVIEYRTQKTSRLPIVDIAPMDIGGADQEFGVDIGPVCF 1485
BLAST of collagen vs. uniprot
Match: gi|8039779|sp|P02465|CO1A2_BOVIN (RecName: Full=Collagen alpha-2(I) chain; AltName: Full=Alpha-2 type I collagen; Flags: Precursor)

HSP 1 Score: 120.553 bits (301), Expect = 1.36103e-25
Identity = 78/225 (34.67%), Postives = 118/225 (52.44%), Query Frame = 3
Query: 7524 NNAIKN---PQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSW--NSPITKGHTWLKSLLNSD-EIQYSIPNG--------QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDT-IIKMTQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXXRKSKFGLTIEEVCF 8150
NN I+ P+G+++ PARTC+ L +P G YWIDPN G T DA+KV+C S +TCI+ ++I +++W NS K H W+ +N + +Y++ G QLAF+++ ++ A+Q IT+ C+N EE N A L +D ++ S+ YTV+ D C + ++IIE K + S LPI DI I + L I VCF
Sbjct: 1141 NNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCIRAQPEDIPVKNWYRNSKAKK-HVWVGETINGGTQFEYNV-EGVTTKEMATQLAFMRLLANHASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKKTNEWQKTIIEYKTNKPSRLPILDIAPLDIGGADQEIRLNIGPVCF 1363

HSP 2 Score: 120.553 bits (301), Expect = 1.36103e-25
Identity = 78/225 (34.67%), Postives = 118/225 (52.44%), Query Frame = -3
Query:  121 NNAIKN---PQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSW--NSPITKGHTWLKSLLNSD-EIQYSIPNG--------QLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDT-IIKMTQSKLKYTVIKDECQYSKSSEAESIIELK-HLASLLPIRDIGXXXXXNRKSKFGLTIEEVCF 747
NN I+ P+G+++ PARTC+ L +P G YWIDPN G T DA+KV+C S +TCI+ ++I +++W NS K H W+ +N + +Y++ G QLAF+++ ++ A+Q IT+ C+N EE N A L +D ++ S+ YTV+ D C + ++IIE K + S LPI DI I + L I VCF
Sbjct: 1141 NNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCIRAQPEDIPVKNWYRNSKAKK-HVWVGETINGGTQFEYNV-EGVTTKEMATQLAFMRLLANHASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKKTNEWQKTIIEYKTNKPSRLPILDIAPLDIGGADQEIRLNIGPVCF 1363

HSP 3 Score: 59.3066 bits (142), Expect = 5.08168e-07
Identity = 50/109 (45.87%), Postives = 64/109 (58.72%), Query Frame = 3
Query: 4962 GKSGKPGLRGRTGP---DGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 5279
G +G PG +G GP GN G+ G G +G G +GP+G +GP+G+RG G PGD GP+G G +G GL+G G G G +G G GP G G GP+GP G DG
Sbjct: 956 GAAGAPGPQGPVGPVGKHGNRGEPGPAGAVGPAGAVGPRGPSGPQGIRGDKGEPGDKGPRGLPGLKGHNGLQGLPGLAGHHGDQGAPGAVGPAGPRGPAGPSGPAGKDG 1064

HSP 4 Score: 59.3066 bits (142), Expect = 5.08168e-07
Identity = 50/109 (45.87%), Postives = 64/109 (58.72%), Query Frame = -3
Query: 2992 GKSGKPGLRGRTGP---DGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 3309
G +G PG +G GP GN G+ G G +G G +GP+G +GP+G+RG G PGD GP+G G +G GL+G G G G +G G GP G G GP+GP G DG
Sbjct: 956 GAAGAPGPQGPVGPVGKHGNRGEPGPAGAVGPAGAVGPRGPSGPQGIRGDKGEPGDKGPRGLPGLKGHNGLQGLPGLAGHHGDQGAPGAVGPAGPRGPAGPSGPAGKDG 1064
BLAST of collagen vs. RefSeq Drosophila melanogaster
Match: gi|24581820|ref|NP_723044.1| (collagen type IV, isoform A [Drosophila melanogaster])

HSP 1 Score: 56.6102 bits (135), Expect = 4.63167e-07
Identity = 104/251 (41.43%), Postives = 122/251 (48.61%), Query Frame = 3
Query: 4596 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPG-------DSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGL---------IGPTGPIGPDGV 5282
++G PGP G G TGP G PG GEKG G G GPPG G G++G G G GE+G G G+ G PG +G +G+PG+PG PG G G G G G PG RG G G +G G KGE G VGL G G GP+G RG G PG G KGSQG++G G +G+ +GP G G KGDRG GP G +GL IGP G IG GV
Sbjct: 1065 QKGEPGPSGLRGDTGPAGTPGWPGEKGLPG-------LAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGA---------------PGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGV 1293

HSP 2 Score: 56.6102 bits (135), Expect = 4.63167e-07
Identity = 104/251 (41.43%), Postives = 122/251 (48.61%), Query Frame = -3
Query: 2989 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPG-------DSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGL---------IGPTGPIGPDGV 3675
++G PGP G G TGP G PG GEKG G G GPPG G G++G G G GE+G G G+ G PG +G +G+PG+PG PG G G G G G PG RG G G +G G KGE G VGL G G GP+G RG G PG G KGSQG++G G +G+ +GP G G KGDRG GP G +GL IGP G IG GV
Sbjct: 1065 QKGEPGPSGLRGDTGPAGTPGWPGEKGLPG-------LAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGA---------------PGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGV 1293

HSP 3 Score: 55.4546 bits (132), Expect = 1.05551e-06
Identity = 50/104 (48.08%), Postives = 58/104 (55.77%), Query Frame = 3
Query: 4977 PGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGP--KGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 5282
P +RG G G G G KGE G+ GL GP G+ G +G RG G PG SG G G +G IG G+ G G KG+KG GRPG G GLIG G IG G+
Sbjct: 1222 PDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGL 1325

HSP 4 Score: 55.4546 bits (132), Expect = 1.05551e-06
Identity = 50/104 (48.08%), Postives = 58/104 (55.77%), Query Frame = -3
Query: 2989 PGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGP--KGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 3294
P +RG G G G G KGE G+ GL GP G+ G +G RG G PG SG G G +G IG G+ G G KG+KG GRPG G GLIG G IG G+
Sbjct: 1222 PDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGL 1325

HSP 5 Score: 54.6842 bits (130), Expect = 1.70422e-06
Identity = 44/95 (46.32%), Postives = 55/95 (57.89%), Query Frame = 3
Query: 4959 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAG 5243
+G G PGL+G TGP G G++G G G G G +GL GP GL G G GD+G G G+ GP+G G+ G GPKG+ G G PG G+ G
Sbjct: 1362 DGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKG 1456

HSP 6 Score: 54.6842 bits (130), Expect = 1.70422e-06
Identity = 44/95 (46.32%), Postives = 55/95 (57.89%), Query Frame = -3
Query: 3028 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAG 3312
+G G PGL+G TGP G G++G G G G G +GL GP GL G G GD+G G G+ GP+G G+ G GPKG+ G G PG G+ G
Sbjct: 1362 DGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKG 1456
BLAST of collagen vs. RefSeq Drosophila melanogaster
Match: gi|24581822|ref|NP_723045.1| (collagen type IV, isoform B [Drosophila melanogaster])

HSP 1 Score: 56.6102 bits (135), Expect = 4.63167e-07
Identity = 104/251 (41.43%), Postives = 122/251 (48.61%), Query Frame = 3
Query: 4596 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPG-------DSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGL---------IGPTGPIGPDGV 5282
++G PGP G G TGP G PG GEKG G G GPPG G G++G G G GE+G G G+ G PG +G +G+PG+PG PG G G G G G PG RG G G +G G KGE G VGL G G GP+G RG G PG G KGSQG++G G +G+ +GP G G KGDRG GP G +GL IGP G IG GV
Sbjct: 1065 QKGEPGPSGLRGDTGPAGTPGWPGEKGLPG-------LAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGA---------------PGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGV 1293

HSP 2 Score: 56.6102 bits (135), Expect = 4.63167e-07
Identity = 104/251 (41.43%), Postives = 122/251 (48.61%), Query Frame = -3
Query: 2989 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPG-------DSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGL---------IGPTGPIGPDGV 3675
++G PGP G G TGP G PG GEKG G G GPPG G G++G G G GE+G G G+ G PG +G +G+PG+PG PG G G G G G PG RG G G +G G KGE G VGL G G GP+G RG G PG G KGSQG++G G +G+ +GP G G KGDRG GP G +GL IGP G IG GV
Sbjct: 1065 QKGEPGPSGLRGDTGPAGTPGWPGEKGLPG-------LAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGA---------------PGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGV 1293

HSP 3 Score: 55.4546 bits (132), Expect = 1.05551e-06
Identity = 50/104 (48.08%), Postives = 58/104 (55.77%), Query Frame = 3
Query: 4977 PGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGP--KGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 5282
P +RG G G G G KGE G+ GL GP G+ G +G RG G PG SG G G +G IG G+ G G KG+KG GRPG G GLIG G IG G+
Sbjct: 1222 PDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGL 1325

HSP 4 Score: 55.4546 bits (132), Expect = 1.05551e-06
Identity = 50/104 (48.08%), Postives = 58/104 (55.77%), Query Frame = -3
Query: 2989 PGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGP--KGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 3294
P +RG G G G G KGE G+ GL GP G+ G +G RG G PG SG G G +G IG G+ G G KG+KG GRPG G GLIG G IG G+
Sbjct: 1222 PDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGL 1325

HSP 5 Score: 54.6842 bits (130), Expect = 1.70422e-06
Identity = 44/95 (46.32%), Postives = 55/95 (57.89%), Query Frame = 3
Query: 4959 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAG 5243
+G G PGL+G TGP G G++G G G G G +GL GP GL G G GD+G G G+ GP+G G+ G GPKG+ G G PG G+ G
Sbjct: 1362 DGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKG 1456

HSP 6 Score: 54.6842 bits (130), Expect = 1.70422e-06
Identity = 44/95 (46.32%), Postives = 55/95 (57.89%), Query Frame = -3
Query: 3028 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAG 3312
+G G PGL+G TGP G G++G G G G G +GL GP GL G G GD+G G G+ GP+G G+ G GPKG+ G G PG G+ G
Sbjct: 1362 DGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKG 1456
BLAST of collagen vs. RefSeq Drosophila melanogaster
Match: gi|24581824|ref|NP_723046.1| (collagen type IV, isoform C [Drosophila melanogaster])

HSP 1 Score: 56.6102 bits (135), Expect = 4.63167e-07
Identity = 104/251 (41.43%), Postives = 122/251 (48.61%), Query Frame = 3
Query: 4596 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPG-------DSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGL---------IGPTGPIGPDGV 5282
++G PGP G G TGP G PG GEKG G G GPPG G G++G G G GE+G G G+ G PG +G +G+PG+PG PG G G G G G PG RG G G +G G KGE G VGL G G GP+G RG G PG G KGSQG++G G +G+ +GP G G KGDRG GP G +GL IGP G IG GV
Sbjct: 1065 QKGEPGPSGLRGDTGPAGTPGWPGEKGLPG-------LAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGA---------------PGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGV 1293

HSP 2 Score: 56.6102 bits (135), Expect = 4.63167e-07
Identity = 104/251 (41.43%), Postives = 122/251 (48.61%), Query Frame = -3
Query: 2989 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPG-------DSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGL---------IGPTGPIGPDGV 3675
++G PGP G G TGP G PG GEKG G G GPPG G G++G G G GE+G G G+ G PG +G +G+PG+PG PG G G G G G PG RG G G +G G KGE G VGL G G GP+G RG G PG G KGSQG++G G +G+ +GP G G KGDRG GP G +GL IGP G IG GV
Sbjct: 1065 QKGEPGPSGLRGDTGPAGTPGWPGEKGLPG-------LAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGA---------------PGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGV 1293

HSP 3 Score: 55.4546 bits (132), Expect = 1.05551e-06
Identity = 50/104 (48.08%), Postives = 58/104 (55.77%), Query Frame = 3
Query: 4977 PGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGP--KGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 5282
P +RG G G G G KGE G+ GL GP G+ G +G RG G PG SG G G +G IG G+ G G KG+KG GRPG G GLIG G IG G+
Sbjct: 1222 PDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGL 1325

HSP 4 Score: 55.4546 bits (132), Expect = 1.05551e-06
Identity = 50/104 (48.08%), Postives = 58/104 (55.77%), Query Frame = -3
Query: 2989 PGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGP--KGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 3294
P +RG G G G G KGE G+ GL GP G+ G +G RG G PG SG G G +G IG G+ G G KG+KG GRPG G GLIG G IG G+
Sbjct: 1222 PDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGL 1325

HSP 5 Score: 54.6842 bits (130), Expect = 1.70422e-06
Identity = 44/95 (46.32%), Postives = 55/95 (57.89%), Query Frame = 3
Query: 4959 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAG 5243
+G G PGL+G TGP G G++G G G G G +GL GP GL G G GD+G G G+ GP+G G+ G GPKG+ G G PG G+ G
Sbjct: 1362 DGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKG 1456

HSP 6 Score: 54.6842 bits (130), Expect = 1.70422e-06
Identity = 44/95 (46.32%), Postives = 55/95 (57.89%), Query Frame = -3
Query: 3028 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAG 3312
+G G PGL+G TGP G G++G G G G G +GL GP GL G G GD+G G G+ GP+G G+ G GPKG+ G G PG G+ G
Sbjct: 1362 DGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKG 1456
BLAST of collagen vs. RefSeq Drosophila melanogaster
Match: gi|442619464|ref|NP_001262641.1| (CG42342, isoform T [Drosophila melanogaster])

HSP 1 Score: 54.299 bits (129), Expect = 2.13288e-06
Identity = 48/101 (47.52%), Postives = 60/101 (59.41%), Query Frame = 3
Query: 4980 GLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 5282
G+RG +GP G +GK G G G G+ G QG TG +G RG G PG G G +G +G G G +GP G +G+KGDRG G QG GL P P+G DG+
Sbjct: 648 GMRGESGPSGPSGKAGIPGPPGLDGMKGAQGETGHKGERGDPGLPGTDGIPGQEGPRGEQGSRGDAGPPGKRGRKGDRGDKGEQGVPGLDAPC-PLGADGL 747

HSP 2 Score: 54.299 bits (129), Expect = 2.13288e-06
Identity = 48/101 (47.52%), Postives = 60/101 (59.41%), Query Frame = -3
Query: 2989 GLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 3291
G+RG +GP G +GK G G G G+ G QG TG +G RG G PG G G +G +G G G +GP G +G+KGDRG G QG GL P P+G DG+
Sbjct: 648 GMRGESGPSGPSGKAGIPGPPGLDGMKGAQGETGHKGERGDPGLPGTDGIPGQEGPRGEQGSRGDAGPPGKRGRKGDRGDKGEQGVPGLDAPC-PLGADGL 747
BLAST of collagen vs. RefSeq Drosophila melanogaster
Match: gi|442619462|ref|NP_001247141.2| (CG42342, isoform S [Drosophila melanogaster])

HSP 1 Score: 53.9138 bits (128), Expect = 2.56381e-06
Identity = 48/101 (47.52%), Postives = 60/101 (59.41%), Query Frame = 3
Query: 4980 GLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 5282
G+RG +GP G +GK G G G G+ G QG TG +G RG G PG G G +G +G G G +GP G +G+KGDRG G QG GL P P+G DG+
Sbjct: 643 GMRGESGPSGPSGKAGIPGPPGLDGMKGAQGETGHKGERGDPGLPGTDGIPGQEGPRGEQGSRGDAGPPGKRGRKGDRGDKGEQGVPGLDAPC-PLGADGL 742

HSP 2 Score: 53.9138 bits (128), Expect = 2.56381e-06
Identity = 48/101 (47.52%), Postives = 60/101 (59.41%), Query Frame = -3
Query: 2989 GLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDGV 3291
G+RG +GP G +GK G G G G+ G QG TG +G RG G PG G G +G +G G G +GP G +G+KGDRG G QG GL P P+G DG+
Sbjct: 643 GMRGESGPSGPSGKAGIPGPPGLDGMKGAQGETGHKGERGDPGLPGTDGIPGQEGPRGEQGSRGDAGPPGKRGRKGDRGDKGEQGVPGLDAPC-PLGADGL 742
BLAST of collagen vs. Smed Unigenes AA
Match: SMU15040033 (dd_smedV4_702_0_1|m.35199|m.6295)

HSP 1 Score: 443.736 bits (1140), Expect = 6.44777e-129
Identity = 235/288 (81.60%), Postives = 262/288 (90.97%), Query Frame = 3
Query: 7293 QGPVGITGPRGDPGIPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXXIMAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXXRKSKFGLTIEEVCFS* 8156
QGPVGI GPRGDPGIPG IGPTGL GKKG G +G +GPLGPPGPPGPPGGIMAMQ+RSPTKGV YGDDP AAELLGNNAIKNP+GTKEVPA TCKQLS ++PN+PDGEYWIDPNGGR +DAVKV+C+ISE+KTCIKP+N EIS+RSW S GHTWLKS+LN +EIQYSIPNGQ+AFLK+ SD+A QR+TF CENHPIIG+EEKLNKV APRLLADDDTIIKMT S LKYTVIKDECQYSKSSEAESIIE+++ A+LLPIRDIG++IINNRKSKFG+TIEEVCFS*
Sbjct: 999 QGPVGIIGPRGDPGIPGPIGPTGLHGKKGGIGIMGPVGPLGPPGPPGPPGGIMAMQMRSPTKGVTYGDDPLAAELLGNNAIKNPEGTKEVPAITCKQLSVKHPNLPDGEYWIDPNGGRVNDAVKVYCRISEQKTCIKPINNEISLRSWKSHSANGHTWLKSILNKEEIQYSIPNGQIAFLKVNSDSAVQRVTFTCENHPIIGNEEKLNKVTAPRLLADDDTIIKMTHSHLKYTVIKDECQYSKSSEAESIIEVRNYANLLPIRDIGVSIINNRKSKFGVTIEEVCFS* 1286

HSP 2 Score: 443.736 bits (1140), Expect = 6.44777e-129
Identity = 235/288 (81.60%), Postives = 262/288 (90.97%), Query Frame = -3
Query:  115 QGPVGITGPRGDPGIPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXGIMAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXNRKSKFGLTIEEVCFS* 978
QGPVGI GPRGDPGIPG IGPTGL GKKG G +G +GPLGPPGPPGPPGGIMAMQ+RSPTKGV YGDDP AAELLGNNAIKNP+GTKEVPA TCKQLS ++PN+PDGEYWIDPNGGR +DAVKV+C+ISE+KTCIKP+N EIS+RSW S GHTWLKS+LN +EIQYSIPNGQ+AFLK+ SD+A QR+TF CENHPIIG+EEKLNKV APRLLADDDTIIKMT S LKYTVIKDECQYSKSSEAESIIE+++ A+LLPIRDIG++IINNRKSKFG+TIEEVCFS*
Sbjct: 999 QGPVGIIGPRGDPGIPGPIGPTGLHGKKGGIGIMGPVGPLGPPGPPGPPGGIMAMQMRSPTKGVTYGDDPLAAELLGNNAIKNPEGTKEVPAITCKQLSVKHPNLPDGEYWIDPNGGRVNDAVKVYCRISEQKTCIKPINNEISLRSWKSHSANGHTWLKSILNKEEIQYSIPNGQIAFLKVNSDSAVQRVTFTCENHPIIGNEEKLNKVTAPRLLADDDTIIKMTHSHLKYTVIKDECQYSKSSEAESIIEVRNYANLLPIRDIGVSIINNRKSKFGVTIEEVCFS* 1286

HSP 3 Score: 240.736 bits (613), Expect = 1.04208e-63
Identity = 252/311 (81.03%), Postives = 273/311 (87.78%), Query Frame = 3
Query: 4299 MLKISIISGAILLVLIYIECVHGQFRTLNEATGPMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDGQRGAPGLQGPPXXXXXXXXXXXXXXXXXXXXXXEQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQ 5231
M K SI SGAILL+LIY++ +GQFRTLNEATGP+GIRGNPGKRGK GPDGDPG+ GP GPPG DG RGAPG GP G GP+GKSG G +GPPGSRGEQGPPGP+GKEGLTGP GY GPSGEKGDSG GEQGD GDIGP GP GP+G PGQ+GPTGPQG VGERG HGPDG+VGPPGPEGR+GSPG PGRPGE+GKKGEGGDEG KGGKGE+GKTGINGKSGKPG+RG GP G NGKQGRKGE+GD+GL GPQGL GPRG+RG+ GNPGD+GPKGSQGDQGPIGLEGK GPFGPKGQKGDRGRPGPQ
Sbjct: 1 MFKNSIFSGAILLILIYVDFSYGQFRTLNEATGPIGIRGNPGKRGKIGPDGDPGSSGPPGPPGKDGLRGAPGPNGPAGGAGPDGKSGVTGNTGPPGSRGEQGPPGPVGKEGLTGPNGYSGPSGEKGDSGSIGEQGDPGDIGPQGPAGPLGPPGQSGPTGPQGTVGERGPHGPDGVVGPPGPEGRMGSPGSPGRPGELGKKGEGGDEGLKGGKGENGKTGINGKSGKPGIRGPIGPVGINGKQGRKGELGDIGLTGPQGLIGPRGVRGTVGNPGDNGPKGSQGDQGPIGLEGKPGPFGPKGQKGDRGRPGPQ 311

HSP 4 Score: 240.736 bits (613), Expect = 1.04208e-63
Identity = 252/311 (81.03%), Postives = 273/311 (87.78%), Query Frame = -3
Query: 3040 MLKISIISGAILLVLIYIECVHGQFRTLNEATGPMXXXXXXXXXXXXXXXXXXXXXXXXXXXXNDGQRGAPGLQGPPXXXXXXXXXXXXXXXXXXXXXGEQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQ 3972
M K SI SGAILL+LIY++ +GQFRTLNEATGP+GIRGNPGKRGK GPDGDPG+ GP GPPG DG RGAPG GP G GP+GKSG G +GPPGSRGEQGPPGP+GKEGLTGP GY GPSGEKGDSG GEQGD GDIGP GP GP+G PGQ+GPTGPQG VGERG HGPDG+VGPPGPEGR+GSPG PGRPGE+GKKGEGGDEG KGGKGE+GKTGINGKSGKPG+RG GP G NGKQGRKGE+GD+GL GPQGL GPRG+RG+ GNPGD+GPKGSQGDQGPIGLEGK GPFGPKGQKGDRGRPGPQ
Sbjct: 1 MFKNSIFSGAILLILIYVDFSYGQFRTLNEATGPIGIRGNPGKRGKIGPDGDPGSSGPPGPPGKDGLRGAPGPNGPAGGAGPDGKSGVTGNTGPPGSRGEQGPPGPVGKEGLTGPNGYSGPSGEKGDSGSIGEQGDPGDIGPQGPAGPLGPPGQSGPTGPQGTVGERGPHGPDGVVGPPGPEGRMGSPGSPGRPGELGKKGEGGDEGLKGGKGENGKTGINGKSGKPGIRGPIGPVGINGKQGRKGELGDIGLTGPQGLIGPRGVRGTVGNPGDNGPKGSQGDQGPIGLEGKPGPFGPKGQKGDRGRPGPQ 311

HSP 5 Score: 87.0409 bits (214), Expect = 1.58173e-16
Identity = 97/131 (74.05%), Postives = 107/131 (81.68%), Query Frame = 3
Query: 5487 KSGNKGALGPIGPSGLRGPPGNPGKDGMIXXXXXXXXXXXXXNVGAPGNKGNIGEPGPKGKTGNSGKPGPAGKNGADGSEGLSGNIGSPGFXXXXXXXXXXXXXXXXXXXXXXXXXXNQGISGKNGAPGTE 5879
KSG KG+LGP G +GLRGP GNPGKDG +GPLG PGLRGPPG++G PG KGNIG PG KGK GN+GKPGP GKNG DGSEG GN GSPGFPGPNGDPG +GPPG G G+ GYPGNQG++GKNG PG E
Sbjct: 397 KSGRKGSLGPTGLTGLRGPQGNPGKDGTLGPLGTPGLRGPPGSIGTPGLKGNIGPPGSKGKVGNAGKPGPLGKNGIDGSEGPIGNAGSPGFPGPNGDPGPNGPPGSLGLAGLVGYPGNQGLAGKNGNPGVE 527

HSP 6 Score: 87.0409 bits (214), Expect = 1.58173e-16
Identity = 97/131 (74.05%), Postives = 107/131 (81.68%), Query Frame = -3
Query: 2392 KSGNKGALGPIGPSGLRGPPGNPGKDGMIXXXXXXXXXXXXGNVGAPGNKGNIGEPGPKGKTGNSGKPGPAGKNGADGSEGLSGNIGSPGFXXXXXXXXXXXXXXXXXXXXXXXXXGNQGISGKNGAPGTE 2784
KSG KG+LGP G +GLRGP GNPGKDG +GPLG PGLRGPPG++G PG KGNIG PG KGK GN+GKPGP GKNG DGSEG GN GSPGFPGPNGDPG +GPPG G G+ GYPGNQG++GKNG PG E
Sbjct: 397 KSGRKGSLGPTGLTGLRGPQGNPGKDGTLGPLGTPGLRGPPGSIGTPGLKGNIGPPGSKGKVGNAGKPGPLGKNGIDGSEGPIGNAGSPGFPGPNGDPGPNGPPGSLGLAGLVGYPGNQGLAGKNGNPGVE 527

HSP 7 Score: 60.8474 bits (146), Expect = 1.24375e-08
Identity = 53/109 (48.62%), Postives = 66/109 (60.55%), Query Frame = 3
Query: 4962 GKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGP------IGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIG 5270
G +G PG RG GP+G +GK GRKG LGP GLTG RG +G+ G G GP G+ G +GP GL+G GP G KG+ G+ G+PGP G+ G+ G GPIG
Sbjct: 378 GPNGAPGPRGEIGPNGPDGKSGRKGS------LGPTGLTGLRGPQGNPGKDGTLGPLGTPGLRGPPGSIGTPGLKGNIGPPGSKGKVGNAGKPGPLGKNGIDGSEGPIG 480

HSP 8 Score: 60.8474 bits (146), Expect = 1.24375e-08
Identity = 53/109 (48.62%), Postives = 66/109 (60.55%), Query Frame = -3
Query: 3001 GKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGP------IGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIG 3309
G +G PG RG GP+G +GK GRKG LGP GLTG RG +G+ G G GP G+ G +GP GL+G GP G KG+ G+ G+PGP G+ G+ G GPIG
Sbjct: 378 GPNGAPGPRGEIGPNGPDGKSGRKGS------LGPTGLTGLRGPQGNPGKDGTLGPLGTPGLRGPPGSIGTPGLKGNIGPPGSKGKVGNAGKPGPLGKNGIDGSEGPIG 480
BLAST of collagen vs. Smed Unigenes AA
Match: SMU15002271 (Asxlregen_comp67208_c0_seq1|m.27270|m.10319)

HSP 1 Score: 245.358 bits (625), Expect = 4.2761e-65
Identity = 117/232 (50.43%), Postives = 158/232 (68.10%), Query Frame = 3
Query: 7461 LRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXXRKSKFGLTIEEVCFS* 8156
LRSPTKG+ + DDP+ A GNNAI P+GTKEVPAR+CK LS NP++ DGEYWIDPNGGR SDAV V+C+I+ ++TCIKP+++ SW K H W +++ E +Y I N QL +LK S+TATQ+I+ C N II + + LL DDDTI+ + K ++ VIKDECQY K SEAE+I+E++ AS LPI+D+G+ I ++R K G+ + EVC+S*
Sbjct: 1064 LRSPTKGLTFSDDPSVAHSFGNNAIITPRGTKEVPARSCKHLSEHNPDLSDGEYWIDPNGGRVSDAVPVYCRIATQQTCIKPISKIYKTASWFKKYQKDHVWFQTINGIGEFEYDIENYQLNYLKALSETATQQISLNCINQAIILDRQGKMSTVWTSLLGDDDTILSLQHPKRRFKVIKDECQYEKFSEAETILEVRGKASRLPIKDVGLIIDSDRSRKVGIELGEVCYS* 1295

HSP 2 Score: 245.358 bits (625), Expect = 4.2761e-65
Identity = 117/232 (50.43%), Postives = 158/232 (68.10%), Query Frame = -3
Query:  115 LRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXNRKSKFGLTIEEVCFS* 810
LRSPTKG+ + DDP+ A GNNAI P+GTKEVPAR+CK LS NP++ DGEYWIDPNGGR SDAV V+C+I+ ++TCIKP+++ SW K H W +++ E +Y I N QL +LK S+TATQ+I+ C N II + + LL DDDTI+ + K ++ VIKDECQY K SEAE+I+E++ AS LPI+D+G+ I ++R K G+ + EVC+S*
Sbjct: 1064 LRSPTKGLTFSDDPSVAHSFGNNAIITPRGTKEVPARSCKHLSEHNPDLSDGEYWIDPNGGRVSDAVPVYCRIATQQTCIKPISKIYKTASWFKKYQKDHVWFQTINGIGEFEYDIENYQLNYLKALSETATQQISLNCINQAIILDRQGKMSTVWTSLLGDDDTILSLQHPKRRFKVIKDECQYEKFSEAETILEVRGKASRLPIKDVGLIIDSDRSRKVGIELGEVCYS* 1295

HSP 3 Score: 96.6709 bits (239), Expect = 1.79997e-19
Identity = 123/237 (51.90%), Postives = 144/237 (60.76%), Query Frame = 3
Query: 4596 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXINGKSGKPGLRGRTGPDGN---------NGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRP------GPQGEAGLIGPTG 5261
E G G IG EG TGP G G +G+KGD G G +G G+ G GP G G G+ GP G QGP GERG G DG+ G GP+G IG PG + G G+ G KG +GESG G G SG PGL G+TGP G+ +G+ G +GE G VG GP G G RG RG SGNPG SGPKGSQG++GPIG+EGK G GPKGQKGD GRP GP+GE G++GP G
Sbjct: 107 EPGLAGSIGPEGGTGPQGNRGLTGDKGDIGLAGLKGSNGEPGLQGPQGLRGPAGRVGPAGIQGPTGERGKQGTDGVPGSLGPQGAIGPPG---------QSGIPGEIGNKGIRGESGIKGAKGDSGNPGLAGKTGPSGSLGPPGYPGVDGRPGVRGEAGIVGPQGPVGKVGQRGQRGPSGNPGLSGPKGSQGEEGPIGIEGKQGSAGPKGQKGDPGRPGETGDEGPRGERGVVGPAG 334

HSP 4 Score: 96.6709 bits (239), Expect = 1.79997e-19
Identity = 123/237 (51.90%), Postives = 144/237 (60.76%), Query Frame = -3
Query: 3010 EQGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPVGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGKSGKPGLRGRTGPDGN---------NGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRP------GPQGEAGLIGPTG 3675
E G G IG EG TGP G G +G+KGD G G +G G+ G GP G G G+ GP G QGP GERG G DG+ G GP+G IG PG + G G+ G KG +GESG G G SG PGL G+TGP G+ +G+ G +GE G VG GP G G RG RG SGNPG SGPKGSQG++GPIG+EGK G GPKGQKGD GRP GP+GE G++GP G
Sbjct: 107 EPGLAGSIGPEGGTGPQGNRGLTGDKGDIGLAGLKGSNGEPGLQGPQGLRGPAGRVGPAGIQGPTGERGKQGTDGVPGSLGPQGAIGPPG---------QSGIPGEIGNKGIRGESGIKGAKGDSGNPGLAGKTGPSGSLGPPGYPGVDGRPGVRGEAGIVGPQGPVGKVGQRGQRGPSGNPGLSGPKGSQGEEGPIGIEGKQGSAGPKGQKGDPGRPGETGDEGPRGERGVVGPAG 334

HSP 5 Score: 56.225 bits (134), Expect = 3.74368e-07
Identity = 57/122 (46.72%), Postives = 70/122 (57.38%), Query Frame = 3
Query: 4959 NGKSGKPGLRGRTGPDGNNGKQGR------KGEIGDVGLLGPQGLTGPRGLRGSSGNPGD---SGPKGSQGD---QGPIGLE---GKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 5279
GKSGKPG G G DG G G +GE G G +GP+G TGP+G RG +G+ GD +G KGS G+ QGP GL G+ GP G +G G+RG+ G G G +GP G IGP G
Sbjct: 75 RGKSGKPGASGIPGKDGTPGSHGVVGPIGPRGEPGLAGSIGPEGGTGPQGNRGLTGDKGDIGLAGLKGSNGEPGLQGPQGLRGPAGRVGPAGIQGPTGERGKQGTDGVPGSLGPQGAIGPPG 196

HSP 6 Score: 56.225 bits (134), Expect = 3.74368e-07
Identity = 57/122 (46.72%), Postives = 70/122 (57.38%), Query Frame = -3
Query: 2992 NGKSGKPGLRGRTGPDGNNGKQGR------KGEIGDVGLLGPQGLTGPRGLRGSSGNPGD---SGPKGSQGD---QGPIGLE---GKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 3312
GKSGKPG G G DG G G +GE G G +GP+G TGP+G RG +G+ GD +G KGS G+ QGP GL G+ GP G +G G+RG+ G G G +GP G IGP G
Sbjct: 75 RGKSGKPGASGIPGKDGTPGSHGVVGPIGPRGEPGLAGSIGPEGGTGPQGNRGLTGDKGDIGLAGLKGSNGEPGLQGPQGLRGPAGRVGPAGIQGPTGERGKQGTDGVPGSLGPQGAIGPPG 196

HSP 7 Score: 53.5286 bits (127), Expect = 1.95066e-06
Identity = 48/107 (44.86%), Postives = 61/107 (57.01%), Query Frame = 3
Query: 4959 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 5279
+GK G+PG G GP G G +G +G G+ G GPQGL G G+PG GP+GS G QG +G +GP GP G++G +GR G G+ GL G GP G G
Sbjct: 699 DGKPGEPGTPGIDGPPGQVGPEGPRGPSGETGEQGPQGLPG------KPGDPGAEGPRGSSGKQG---FQGPTGPIGPTGKQGKQGRAGKSGKNGLTGRKGPAGQRG 796

HSP 8 Score: 53.5286 bits (127), Expect = 1.95066e-06
Identity = 48/107 (44.86%), Postives = 61/107 (57.01%), Query Frame = -3
Query: 2992 NGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 3312
+GK G+PG G GP G G +G +G G+ G GPQGL G G+PG GP+GS G QG +G +GP GP G++G +GR G G+ GL G GP G G
Sbjct: 699 DGKPGEPGTPGIDGPPGQVGPEGPRGPSGETGEQGPQGLPG------KPGDPGAEGPRGSSGKQG---FQGPTGPIGPTGKQGKQGRAGKSGKNGLTGRKGPAGQRG 796

HSP 9 Score: 52.7582 bits (125), Expect = 3.60718e-06
Identity = 46/99 (46.46%), Postives = 52/99 (52.53%), Query Frame = 3
Query: 4989 GRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDGVK 5285
GR G G +GK G G G G G G+ GP G RG G G GP+G G QG GL G G G G KG G PG QG GL GP G +GP G++
Sbjct: 70 GRDGARGKSGKPGASGIPGKDGTPGSHGVVGPIGPRGEPGLAGSIGPEGGTGPQGNRGLTGDKGDIGLAGLKGSNGEPGLQGPQGLRGPAGRVGPAGIQ 168

HSP 10 Score: 52.7582 bits (125), Expect = 3.60718e-06
Identity = 46/99 (46.46%), Postives = 52/99 (52.53%), Query Frame = -3
Query: 2986 GRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDGVK 3282
GR G G +GK G G G G G G+ GP G RG G G GP+G G QG GL G G G G KG G PG QG GL GP G +GP G++
Sbjct: 70 GRDGARGKSGKPGASGIPGKDGTPGSHGVVGPIGPRGEPGLAGSIGPEGGTGPQGNRGLTGDKGDIGLAGLKGSNGEPGLQGPQGLRGPAGRVGPAGIQ 168
BLAST of collagen vs. Smed Unigenes AA
Match: SMU15036469 (dd_smedV4_1070_0_1|m.1593|m.7941)

HSP 1 Score: 168.318 bits (425), Expect = 2.10492e-41
Identity = 89/225 (39.56%), Postives = 130/225 (57.78%), Query Frame = 3
Query: 7482 VVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXXR--KSKFGLTIEEVCF 8150
++ DDP A+ LGN+AI P GT PAR+C L+ NP+ PDG YWIDPNGG+ DAV+V+CKI E+KTCIKPL +I ++ W ++ I YS+ Q+ FLK+ S+ A+Q IT C N P+I N V R+ D+D I+ + Y V++D CQ++ + + +E+ + LPI+DI ++ +N R +S+ IEEVCF
Sbjct: 1078 IIQADDPLIAKYLGNDAISKPLGTSNSPARSCLHLAEMNPSFPDGIYWIDPNGGKIDDAVQVYCKIKEKKTCIKPLVFKIQLQKPK------FNWFSQSNDNKFISYSLDQQQMTFLKMISNKASQFITINCRNMPVI-----KNSVKPLRIFTDNDIILDSSDQIFSYKVLEDNCQHNSQDLSSTRLEITSRPTRLPIKDIEVDTVNVRSERSQIEYNIEEVCF 1291

HSP 2 Score: 168.318 bits (425), Expect = 2.10492e-41
Identity = 89/225 (39.56%), Postives = 130/225 (57.78%), Query Frame = -3
Query:  121 VVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXNR--KSKFGLTIEEVCF 789
++ DDP A+ LGN+AI P GT PAR+C L+ NP+ PDG YWIDPNGG+ DAV+V+CKI E+KTCIKPL +I ++ W ++ I YS+ Q+ FLK+ S+ A+Q IT C N P+I N V R+ D+D I+ + Y V++D CQ++ + + +E+ + LPI+DI ++ +N R +S+ IEEVCF
Sbjct: 1078 IIQADDPLIAKYLGNDAISKPLGTSNSPARSCLHLAEMNPSFPDGIYWIDPNGGKIDDAVQVYCKIKEKKTCIKPLVFKIQLQKPK------FNWFSQSNDNKFISYSLDQQQMTFLKMISNKASQFITINCRNMPVI-----KNSVKPLRIFTDNDIILDSSDQIFSYKVLEDNCQHNSQDLSSTRLEITSRPTRLPIKDIEVDTVNVRSERSQIEYNIEEVCF 1291

HSP 3 Score: 61.6178 bits (148), Expect = 7.44958e-09
Identity = 54/111 (48.65%), Postives = 64/111 (57.66%), Query Frame = 3
Query: 4956 INGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIG 5270
+NG G G G G G +G G KGE G +GLLGPQGL+GP GL+GS G PG G KG++G GP+G G G G +G+ G G PGPQG GL G GP G
Sbjct: 237 VNGAPGPIGQPGIMGSRGKDGPIGIKGENGPLGLLGPQGLSGPPGLQGSLGPPGPQGSKGNEGKIGPVGPAGSPGSPGLIGEIGERGENGPFGNPGPQGPRGLRGSAGPKG 347

HSP 4 Score: 61.6178 bits (148), Expect = 7.44958e-09
Identity = 54/111 (48.65%), Postives = 64/111 (57.66%), Query Frame = -3
Query: 3001 INGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGK------SGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIG 3315
+NG G G G G G +G G KGE G +GLLGPQGL+GP GL+GS G PG G KG++G GP+G G G G +G+ G G PGPQG GL G GP G
Sbjct: 237 VNGAPGPIGQPGIMGSRGKDGPIGIKGENGPLGLLGPQGLSGPPGLQGSLGPPGPQGSKGNEGKIGPVGPAGSPGSPGLIGEIGERGENGPFGNPGPQGPRGLRGSAGPKG 347
BLAST of collagen vs. Smed Unigenes AA
Match: SMU15029208 (SmedSxlregen_c102983_g1_i1|m.43148|m.11576)

HSP 1 Score: 153.295 bits (386), Expect = 9.71203e-37
Identity = 104/299 (34.78%), Postives = 159/299 (53.18%), Query Frame = 3
Query: 7293 QGPVGITGPRGDPG---------IPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXXIMAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWN-SPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXXR-KSKFGLTIEEVCFS* 8156
QG VG TG +G PG +PG GP GL P P + +R+ + ++ DD A + LG++ +K P GTK++PARTCKQL NPN+ DG Y+IDPNGG+ DA +V C+ +++CI+P + ++ W+ S +T+ +W + + + Y I QL FLK++S A QRIT C ++G+ E + V+ L +D D + KY+VI+D C+ S + +E+ A LPIRDI +N +++ + +FGL I +VCFS*
Sbjct: 1038 QGSVGPTGTKGFPGENGSPGPVGMPGRDGPAGLP-----------GPVGNTGPPGPPGPPAVFFPVRTVRRDLLN-DDALAVKYLGSDVVKKPLGTKDIPARTCKQLLDANPNLQDGFYYIDPNGGKADDAFRVLCRSQRKESCIEPKSPSYKLKHWDASDVTEYRSWFGEITGTFKFDYQIEASQLMFLKLFSTNARQRITINCSRLSVVGNSE--HPVI---LYSDHDEEVLRNGDLFKYSVIRDGCKNSAEIIDSTELEMDTEAIRLPIRDIALNTGSSKEQQQFGLDIGQVCFS* 1319

HSP 2 Score: 153.295 bits (386), Expect = 9.71203e-37
Identity = 104/299 (34.78%), Postives = 159/299 (53.18%), Query Frame = -3
Query:  115 QGPVGITGPRGDPG---------IPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXGIMAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWN-SPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXNR-KSKFGLTIEEVCFS* 978
QG VG TG +G PG +PG GP GL P P + +R+ + ++ DD A + LG++ +K P GTK++PARTCKQL NPN+ DG Y+IDPNGG+ DA +V C+ +++CI+P + ++ W+ S +T+ +W + + + Y I QL FLK++S A QRIT C ++G+ E + V+ L +D D + KY+VI+D C+ S + +E+ A LPIRDI +N +++ + +FGL I +VCFS*
Sbjct: 1038 QGSVGPTGTKGFPGENGSPGPVGMPGRDGPAGLP-----------GPVGNTGPPGPPGPPAVFFPVRTVRRDLLN-DDALAVKYLGSDVVKKPLGTKDIPARTCKQLLDANPNLQDGFYYIDPNGGKADDAFRVLCRSQRKESCIEPKSPSYKLKHWDASDVTEYRSWFGEITGTFKFDYQIEASQLMFLKLFSTNARQRITINCSRLSVVGNSE--HPVI---LYSDHDEEVLRNGDLFKYSVIRDGCKNSAEIIDSTELEMDTEAIRLPIRDIALNTGSSKEQQQFGLDIGQVCFS* 1319

HSP 3 Score: 52.373 bits (124), Expect = 4.8197e-06
Identity = 70/198 (35.35%), Postives = 82/198 (41.41%), Query Frame = 3
Query: 4788 VGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQG-LTGPRGLRGSSGNPGDSGPKGS------------------------QGDQGPIGLEGKSGP---------FGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 5279
+G G G GIVGPPGPEG G GLP G PG G GP G GK+G+ G +G V G G +GPRG G SG PG SGPKG +G++GP G++G GP GP+G G G PG G G IGP G IGP+G
Sbjct: 611 IGVPGIPGKPGIVGPPGPEGFKGDKGLP---------------------------------GNPGTPGVIGPQGLRGKRGKAGGLGKVSFTGRSGGQSGPRGKPGPSGKPGTSGPKGVAGPPGPMGEPGPTGPTGPIGSIGLKGERGPNGIDGSIGPPGRNGAPGAVGPQGLIGLPGTPGTTGSVGEIGPPGQIGPNG 775

HSP 4 Score: 52.373 bits (124), Expect = 4.8197e-06
Identity = 70/198 (35.35%), Postives = 82/198 (41.41%), Query Frame = -3
Query: 2992 VGERGTHGPDGIVGPPGPEGRIGSPGLPXXXXXXXXXXXXXXXXXXXXXXXXXXXGINGKSGKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQG-LTGPRGLRGSSGNPGDSGPKGS------------------------QGDQGPIGLEGKSGP---------FGPKGQKGDRGRPGPQGEAGLIGPTGPIGPDG 3483
+G G G GIVGPPGPEG G GLP G PG G GP G GK+G+ G +G V G G +GPRG G SG PG SGPKG +G++GP G++G GP GP+G G G PG G G IGP G IGP+G
Sbjct: 611 IGVPGIPGKPGIVGPPGPEGFKGDKGLP---------------------------------GNPGTPGVIGPQGLRGKRGKAGGLGKVSFTGRSGGQSGPRGKPGPSGKPGTSGPKGVAGPPGPMGEPGPTGPTGPIGSIGLKGERGPNGIDGSIGPPGRNGAPGAVGPQGLIGLPGTPGTTGSVGEIGPPGQIGPNG 775
BLAST of collagen vs. Smed Unigenes AA
Match: SMU15040144 (dd_smedV4_740_0_1|m.36260|m.3408)

HSP 1 Score: 147.517 bits (371), Expect = 4.88724e-35
Identity = 114/289 (39.45%), Postives = 166/289 (57.44%), Query Frame = 3
Query: 7296 GPVGITGPRGDPGIPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXXIMAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWN-SPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLL-ADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXXRKSKFGLTIEEVCFS* 8156
GP G TGP G+ G+ +GP G++G+ G +GP G +G GPPGPPGPP ++ + + P + +Y DD AA +LG++ I P GTK++PAR+C L S + ++ DG Y+IDPNGG+ +DA +VFCK+ +TCI P S S++ +PI K + L Y I N QL FLK+ S A Q I AC N ++ K P ++ D++ + L Y VIKDECQ S EAE+++ + + LPIRD+ ++ S+F L + +VCFS*
Sbjct: 1043 GPKGPTGPNGEMGL---MGPMGVTGRDGPSGPHGLMGNAGPPGPPGPPAMMLPI-IYDPNR-PMYSDDANAANILGSDTISVPLGTKDLPARSCNHLKSTSSHLKDGTYFIDPNGGKMNDAFEVFCKMETGETCISPKQSSFSKISYSENPINK-YISYGELSGIQRFDYVIDNTQLMFLKMVSTRANQEIKIACNNMAVV------EKTEYPAIIFTDNNRELTKDDHHLSYKVIKDECQNMSSEEAETVLLVSGDSKRLPIRDL-TLGSDSDISEFRLKLSKVCFS* 1318

HSP 2 Score: 147.517 bits (371), Expect = 4.88724e-35
Identity = 114/289 (39.45%), Postives = 166/289 (57.44%), Query Frame = -3
Query:  115 GPVGITGPRGDPGIPGAIGPTGLSGKKGSNXXXXXXXXXXXXXXXXXXXGIMAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSENPNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWN-SPITKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIGSEEKLNKVMAPRLL-ADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIELKHLASLLPIRDIGXXXXXNRKSKFGLTIEEVCFS* 975
GP G TGP G+ G+ +GP G++G+ G +GP G +G GPPGPPGPP ++ + + P + +Y DD AA +LG++ I P GTK++PAR+C L S + ++ DG Y+IDPNGG+ +DA +VFCK+ +TCI P S S++ +PI K + L Y I N QL FLK+ S A Q I AC N ++ K P ++ D++ + L Y VIKDECQ S EAE+++ + + LPIRD+ ++ S+F L + +VCFS*
Sbjct: 1043 GPKGPTGPNGEMGL---MGPMGVTGRDGPSGPHGLMGNAGPPGPPGPPAMMLPI-IYDPNR-PMYSDDANAANILGSDTISVPLGTKDLPARSCNHLKSTSSHLKDGTYFIDPNGGKMNDAFEVFCKMETGETCISPKQSSFSKISYSENPINK-YISYGELSGIQRFDYVIDNTQLMFLKMVSTRANQEIKIACNNMAVV------EKTEYPAIIFTDNNRELTKDDHHLSYKVIKDECQNMSSEEAETVLLVSGDSKRLPIRDL-TLGSDSDISEFRLKLSKVCFS* 1318

HSP 3 Score: 59.3066 bits (142), Expect = 3.67659e-08
Identity = 49/100 (49.00%), Postives = 58/100 (58.00%), Query Frame = 3
Query: 4971 GKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIG 5270
GKPG G+ G G +GK G GE G G +GPQGL GP G +G G PGD G G +G G +G +G GP GP G G+ G GP G G GP+GP G
Sbjct: 974 GKPGKPGKEGKPGKDGKTGPVGEPGHPGWMGPQGLLGPPGPQGDRGKPGDPGSPGIEGSPGDVGDQGVPGPKGPTGPNGEMGLMGPMGVTGRDGPSGPHG 1073

HSP 4 Score: 59.3066 bits (142), Expect = 3.67659e-08
Identity = 49/100 (49.00%), Postives = 58/100 (58.00%), Query Frame = -3
Query: 3001 GKPGLRGRTGPDGNNGKQGRKGEIGDVGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKGQKGDRGRPGPQGEAGLIGPTGPIG 3300
GKPG G+ G G +GK G GE G G +GPQGL GP G +G G PGD G G +G G +G +G GP GP G G+ G GP G G GP+GP G
Sbjct: 974 GKPGKPGKEGKPGKDGKTGPVGEPGHPGWMGPQGLLGPPGPQGDRGKPGDPGSPGIEGSPGDVGDQGVPGPKGPTGPNGEMGLMGPMGVTGRDGPSGPHG 1073
The following BLAST results are available for this feature:
BLAST of collagen vs. RefSeq Human
Analysis Date: 2016-03-08 (Girardia Sp. BLASTX Human)
Total hits: 5
Match NameE-valueIdentityDescription
gi|767973237|ref|XP_011536230.1|4.849390e-2532.42PREDICTED: collagen alpha-1(II) chain isoform X1 [... [more]
gi|767973239|ref|XP_011536231.1|4.849390e-2532.42PREDICTED: collagen alpha-1(II) chain isoform X1 [... [more]
gi|767973241|ref|XP_011536232.1|4.849390e-2532.42PREDICTED: collagen alpha-1(II) chain isoform X1 [... [more]
gi|767973243|ref|XP_011536233.1|4.849390e-2532.42PREDICTED: collagen alpha-1(II) chain isoform X1 [... [more]
gi|767973245|ref|XP_011536234.1|4.849390e-2532.42PREDICTED: collagen alpha-1(II) chain isoform X1 [... [more]
back to top
BLAST of collagen vs. uniprot
Analysis Date: 2016-03-08 (Girardia Sp. BLASTX Swissprot Uniprot)
Total hits: 5
Match NameE-valueIdentityDescription
gi|18202526|sp|Q28668|CO1A2_RABIT1.693030e-2936.51RecName: Full=Collagen alpha-2(I) chain; AltName: ... [more]
gi|115286|sp|P02460|CO2A1_CHICK3.468060e-2632.73RecName: Full=Collagen alpha-1(II) chain; AltName:... [more]
gi|82202407|sp|Q6P4Z2|CO2A1_XENTR4.161510e-2635.29RecName: Full=Collagen alpha-1(II) chain; AltName:... [more]
gi|146286085|sp|Q91717|CO2A1_XENLA1.211580e-2535.40RecName: Full=Collagen alpha-1(II) chain; AltName:... [more]
gi|8039779|sp|P02465|CO1A2_BOVIN1.361030e-2534.67RecName: Full=Collagen alpha-2(I) chain; AltName: ... [more]
back to top
BLAST of collagen vs. RefSeq Drosophila melanogaster
Analysis Date: 2016-03-09 (Girardia Sp. BLASTX Drosophila melanogaster)
Total hits: 5
Match NameE-valueIdentityDescription
gi|24581820|ref|NP_723044.1|4.631670e-741.43collagen type IV, isoform A [Drosophila melanogast... [more]
gi|24581822|ref|NP_723045.1|4.631670e-741.43collagen type IV, isoform B [Drosophila melanogast... [more]
gi|24581824|ref|NP_723046.1|4.631670e-741.43collagen type IV, isoform C [Drosophila melanogast... [more]
gi|442619464|ref|NP_001262641.1|2.132880e-647.52CG42342, isoform T [Drosophila melanogaster][more]
gi|442619462|ref|NP_001247141.2|2.563810e-647.52CG42342, isoform S [Drosophila melanogaster][more]
back to top
BLAST of collagen vs. Smed Unigenes AA
Analysis Date: 2016-03-09 (Girardia Sp. BLASTX Schmidtea mediterranea)
Total hits: 5
Match NameE-valueIdentityDescription
SMU150400336.447770e-12981.60dd_smedV4_702_0_1|m.35199|m.6295[more]
SMU150022714.276100e-6550.43Asxlregen_comp67208_c0_seq1|m.27270|m.10319[more]
SMU150364692.104920e-4139.56dd_smedV4_1070_0_1|m.1593|m.7941[more]
SMU150292089.712030e-3734.78SmedSxlregen_c102983_g1_i1|m.43148|m.11576[more]
SMU150401444.887240e-3539.45dd_smedV4_740_0_1|m.36260|m.3408[more]
back to top
Sequences
The following sequences are available for this feature:

mRNA sequence

>Gsp_012135 ID=Gsp_012135|Name=collagen|organism=Girardia sp.|type=mRNA|length=8270bp
TTTTTTTTGAAAAAAATCAATACGTTTATTATTGATTAATTACAGAACAA
AATAAATTTCATTTAATATACAACTATTAGCAAAAATGTAAATTGATTTT
TTTAAAGTCAGAAATTATGAAAAGCAGACTTCTTCTATTGTCAATCCAAA
TTTACTCTTTCTATTATTGATAATATTGATGCCAATATCACGAATCGGTA
GTAAGCTCGCCAAATGTTTCAATTCTATTATACTTTCAGCTTCACTGGAT
TTGGAATATTGACACTCATCTTTGATAACTGTGTATTTCAATTTTGATTG
TGTCATTTTAATTATGGTATCATCATCAGCTAATAAACGGGGTGCCATTA
CTTTATTTAATTTTTCTTCACTGCCAATTATCGGATGATTTTCACAAGCA
AATGTGATTCTTTGAGTGGCTGTGTCACTATAAATTTTGAGAAATGCGAG
TTGACCATTTGGTATTGAATATTGGATCTCGTCTGAATTTAATAATGATT
TCAACCATGTGTGCCCTTTGGTAATCGGACTATTCCATGATCGAATACTA
ATTTCTTGATTTAATGGTTTAATACAGGTTTTTTCTTCAGATATTTTACA
AAAGACTTTGACAGCATCACTTGTCCTACCACCATTTGGATCTATCCAAT
ATTCGCCATCAGGAATATTTGGATTTTCTGAACTCAACTGTTTACATGTT
CTAGCTGGAACTTCTTTGGTGCCTTGAGGATTTTTAATGGCATTATTTCC
AAGAAGTTCAGCGGCAGCAGGATCATCTCCATAAACAACACCCTTTGTAG
GAGAACGTAGTTGCATAGCCATAATACCACCTGGTGGTCCAGGAGGACCT
GGAGGTCCTAATGGTCCCAATTGGCCTAAAGGACCATTTGAACCTTTCTT
ACCCGAAAGACCAGTTGGACCAATAGCTCCTGGTATTCCAGGATCACCAC
GGGGACCAGTTATTCCAACTGGACCTTGTATTCCATCAGGACCCTTAGGT
CCAACAGGCCCATCTGGACCAGGAAAACCTGGTGGTCCTGGATCTCCTTC
TATCCCTATTAATCCTGGATATCCAGGGACTCCTTTTGGTCCTTTTACAC
CTTTTGCACCTCTAGGACCCGGTCTTCCTTCACGTCCTGGTACTCCATCT
TTTCCTTTTTTTCCTTTTGCTCCTCTCAAACCAACATCTCCAAGTACACC
GCCTGGTCCAGGCTTACCATCTGCACCTTTATCACCAGGAGTTCCTAACT
TTCCTGATTTTCCGCTAGTTCCTCCTTGGCCATCATATCCTTTCCCACCA
CTTGTTCCATCTTTACCTGGACTGCCTGCAGCTCCTTTACTACCAGGAGG
TCCAGGGTCACCTCGTGGACCTACTGGACCAGTAATTCCAGGATCTCCAA
CCACACCGACCGGACCAGGTTTGCCCATTTCTCCATTATCTCCTATTGGA
CCACGATTTCCTGGAGGACCTTTTTTTCCTGGTTTTCCTGTTATCCCCTG
TGGACCCGAGGGTCCTGGTGGACCCAAATATCCAGGATAGCCGTCAGGAC
CAGATGTACCAACTTTTCCATCAGTACCAGGAGCTCCGGGTTTACCTTTT
GGACCTACTGGACCTCTTAAACCTAAAGGACCTTTTCGACCACCTGATCC
ATCTAAACCTGTAACTCCTTGAGGACCAAGAGGTCCAGGTTTTCCAGCAG
GCCCAGGTTTACCTTGAATACCAGATTTTCCTTGTTTCCCTGGACCTCCT
GGACTACCTTGTCTGCCAGGACCACCGCGCGGCCCACCTGGGCCAGATTT
TCCAGGATCACCAACATCTCCGGGTGGACCAGAAGCACCTTCTGGGCCAC
TTAATCCTTCAGGTCCTTGTTTTCCTGGTGCTCCAGGCTTGCCTGGCTTT
CCGTCTGGTCCAGAATCACCAGAAGCACCAATAGGTCCCATATCTCCAGG
ATCCCCAGGGTCGCCAGGATTACCTTCTGGACCTTGTTTTCCAGGTGGTC
CCAGATCACCGGGTAATCCATCCATACCTGGCTTTCCAACTGGTCCTCTA
ATACCAGCAGCTCCAACACTGCGAGGTTTGTGTTTTTCTACTCCTTCTCC
TCCTGCAGATCCTGGTGAACCATTGTCCCCTGGAGGACCTCTTGGCCCAT
CTGGTCCTGGACTGCCAAGCAAACCTGGTGCACCAACCGGACCTTCGGGT
CCTTGTTTTCCCAATGGACCGAGAGCACCTGGTGGGCCATTCGGCCCATA
ATTTCCTGGAATTCCAACTATACCAGGAGCTCCTTTTAATCCATTTGGTC
CAGGAGGACCAGGATTTCCACTGTCACCAATAGGACCAGGAACTCCCGCT
TTACCAACAGGACCTACCGGTCCAGATTTTCCAGGATTTCCTTCAGTTCC
AGGTGCACCATTTTTACCGCTTATACCTTGATTTCCAGGATATCCAGGAA
TTCCAATAGCACCTGCTGGACCAGGAGGTCCACTGGCTCCAGGATCTCCA
TTTGGTCCTGGGAAACCAGGACTACCAATATTTCCACTCAGGCCTTCACT
GCCATCAGCCCCATTCTTTCCAGCTGGTCCAGGTTTTCCAGAATTTCCAG
TTTTGCCTTTTGGTCCTGGTTCACCGATGTTTCCTTTATTTCCAGGTGCT
CCAACATTGCCTGGTGGTCCTCTTAATCCTGGAAGTCCTAGAGGGCCTAT
CATACCATCTTTCCCGGGATTGCCTGGGGGTCCACGTAATCCAGAAGGTC
CTATCGGTCCTAAGGCACCTTTATTTCCAGATTTTCCATCAGGACCTGGG
GGTCCAATTTCTCCTCTGGGTCCAGGAGCTCCATTCGGACCAGGTCTTCC
TAAAGCACCTGTCGAACCACCAGGACCAGAGTTTCCAGGATGACCTGGAT
TTCCAGGTGGACCATCAGCTCCTGGTTTCCCATTTGGTCCTGGACTTCCG
GGACTACCATTAGGTCCGCTTTCACCTCTTGATCCTTTAACTCCATCTGG
GCCTATTGGACCAGTTGGTCCAATCAGTCCGGCTTCTCCCTGGGGTCCTG
GACGACCTCTATCTCCTTTTTGTCCTTTCGGACCAAATGGTCCGGATTTT
CCTTCCAATCCGATTGGACCTTGATCTCCTTGAGAACCTTTAGGACCAGA
ATCTCCTGGATTACCTGATGATCCACGCAACCCTCGTGGTCCTGTTAATC
CTTGTGGACCAAGCAATCCAACATCACCAATTTCACCTTTTCTTCCTTGT
TTCCCATTATTACCATCAGGACCTGTTCGTCCCCTTAAACCTGGCTTTCC
AGATTTACCATTTATTCCAGTTTTACCGCTTTCTCCTTTCCCACCTTTTG
CTCCTTCATCTCCTCCTTCACCCTTTTTGCCTATTTCACCAGGTCGGCCT
GGTAATCCTGGAGAACCAATTCTACCTTCTGGACCAGGTGGACCTACAAT
TCCATCAGGTCCATGAGTACCTCTTTCTCCGACTGGACCTTGTGGACCAG
TAGGACCATTTTGACCAGGTTGACCAATTGGTCCAGGAGGACCTGGTGGA
CCAATATCTCCTGTATCACCCTGTTCACCAGTACCCCCACTGTCTCCTTT
TTCTCCAGATGGTCCTGGATATCCTACTGGACCTGTTAAACCTTCTTTAC
CTATAGGCCCAGGTGGACCTTGTTCACCTCTTGATCCAGGCGGCCCTGAA
ACACCACCAGCTCCAGATTTACCTTCAGGCCCAACAGAACCAGGTGGACC
CTGCAACCCAGGTGCACCTCTTTGTCCATCATTTCCAGGAGGACCTGTTG
GACCCTGTGCACCAGGATCTCCATCAGGACCTGGTTTCCCCCTCTTTCCA
GGATTTCCTCTGATTCCCATGGGACCAGTAGCTTCATTTAAAGTTCTGAA
TTGACCATGTACACATTCAATATAAATTAAAACCAATAAAATAGCTCCAG
AAATAATTGAGATTTTGAGCATTTTAAGTTAAGCAGTTCAAGATAGAACT
GTTAAAAGAAATATTTCCAATTTGTCTTAATTGATATCGGTTTTTGAATA
ATAATTCCTTTAATTTTTGTTAAAAATAATTATAAAGAAAATTTAATTCT
ACCGAATAATTTTGAAAATATATGCAAAAGATTTGCAAATCTTTTGCATA
TATTTTCAAAATTATTCGGTAGAATTAAATTTTCTTTATAATTATTTTTA
ACAAAAATTAAAGGAATTATTATTCAAAAACCGATATCAATTAAGACAAA
TTGGAAATATTTCTTTTAACAGTTCTATCTTGAACTGCTTAACTTAAAAT
GCTCAAAATCTCAATTATTTCTGGAGCTATTTTATTGGTTTTAATTTATA
TTGAATGTGTACATGGTCAATTCAGAACTTTAAATGAAGCTACTGGTCCC
ATGGGAATCAGAGGAAATCCTGGAAAGAGGGGGAAACCAGGTCCTGATGG
AGATCCTGGTGCACAGGGTCCAACAGGTCCTCCTGGAAATGATGGACAAA
GAGGTGCACCTGGGTTGCAGGGTCCACCTGGTTCTGTTGGGCCTGAAGGT
AAATCTGGAGCTGGTGGTGTTTCAGGGCCGCCTGGATCAAGAGGTGAACA
AGGTCCACCTGGGCCTATAGGTAAAGAAGGTTTAACAGGTCCAGTAGGAT
ATCCAGGACCATCTGGAGAAAAAGGAGACAGTGGGGGTACTGGTGAACAG
GGTGATACAGGAGATATTGGTCCACCAGGTCCTCCTGGACCAATTGGTCA
ACCTGGTCAAAATGGTCCTACTGGTCCACAAGGTCCAGTCGGAGAAAGAG
GTACTCATGGACCTGATGGAATTGTAGGTCCACCTGGTCCAGAAGGTAGA
ATTGGTTCTCCAGGATTACCAGGCCGACCTGGTGAAATAGGCAAAAAGGG
TGAAGGAGGAGATGAAGGAGCAAAAGGTGGGAAAGGAGAAAGCGGTAAAA
CTGGAATAAATGGTAAATCTGGAAAGCCAGGTTTAAGGGGACGAACAGGT
CCTGATGGTAATAATGGGAAACAAGGAAGAAAAGGTGAAATTGGTGATGT
TGGATTGCTTGGTCCACAAGGATTAACAGGACCACGAGGGTTGCGTGGAT
CATCAGGTAATCCAGGAGATTCTGGTCCTAAAGGTTCTCAAGGAGATCAA
GGTCCAATCGGATTGGAAGGAAAATCCGGACCATTTGGTCCGAAAGGACA
AAAAGGAGATAGAGGTCGTCCAGGACCCCAGGGAGAAGCCGGACTGATTG
GACCAACTGGTCCAATAGGCCCAGATGGAGTTAAAGGATCAAGAGGTGAA
AGCGGACCTAATGGTAGTCCCGGAAGTCCAGGACCAAATGGGAAACCAGG
AGCTGATGGTCCACCTGGAAATCCAGGTCATCCTGGAAACTCTGGTCCTG
GTGGTTCGACAGGTGCTTTAGGAAGACCTGGTCCGAATGGAGCTCCTGGA
CCCAGAGGAGAAATTGGACCCCCAGGTCCTGATGGAAAATCTGGAAATAA
AGGTGCCTTAGGACCGATAGGACCTTCTGGATTACGTGGACCCCCAGGCA
ATCCCGGGAAAGATGGTATGATAGGCCCTCTAGGACTTCCAGGATTAAGA
GGACCACCAGGCAATGTTGGAGCACCTGGAAATAAAGGAAACATCGGTGA
ACCAGGACCAAAAGGCAAAACTGGAAATTCTGGAAAACCTGGACCAGCTG
GAAAGAATGGGGCTGATGGCAGTGAAGGCCTGAGTGGAAATATTGGTAGT
CCTGGTTTCCCAGGACCAAATGGAGATCCTGGAGCCAGTGGACCTCCTGG
TCCAGCAGGTGCTATTGGAATTCCTGGATATCCTGGAAATCAAGGTATAA
GCGGTAAAAATGGTGCACCTGGAACTGAAGGAAATCCTGGAAAATCTGGA
CCGGTAGGTCCTGTTGGTAAAGCGGGAGTTCCTGGTCCTATTGGTGACAG
TGGAAATCCTGGTCCTCCTGGACCAAATGGATTAAAAGGAGCTCCTGGTA
TAGTTGGAATTCCAGGAAATTATGGGCCGAATGGCCCACCAGGTGCTCTC
GGTCCATTGGGAAAACAAGGACCCGAAGGTCCGGTTGGTGCACCAGGTTT
GCTTGGCAGTCCAGGACCAGATGGGCCAAGAGGTCCTCCAGGGGACAATG
GTTCACCAGGATCTGCAGGAGGAGAAGGAGTAGAAAAACACAAACCTCGC
AGTGTTGGAGCTGCTGGTATTAGAGGACCAGTTGGAAAGCCAGGTATGGA
TGGATTACCCGGTGATCTGGGACCACCTGGAAAACAAGGTCCAGAAGGTA
ATCCTGGCGACCCTGGGGATCCTGGAGATATGGGACCTATTGGTGCTTCT
GGTGATTCTGGACCAGACGGAAAGCCAGGCAAGCCTGGAGCACCAGGAAA
ACAAGGACCTGAAGGATTAAGTGGCCCAGAAGGTGCTTCTGGTCCACCCG
GAGATGTTGGTGATCCTGGAAAATCTGGCCCAGGTGGGCCGCGCGGTGGT
CCTGGCAGACAAGGTAGTCCAGGAGGTCCAGGGAAACAAGGAAAATCTGG
TATTCAAGGTAAACCTGGGCCTGCTGGAAAACCTGGACCTCTTGGTCCTC
AAGGAGTTACAGGTTTAGATGGATCAGGTGGTCGAAAAGGTCCTTTAGGT
TTAAGAGGTCCAGTAGGTCCAAAAGGTAAACCCGGAGCTCCTGGTACTGA
TGGAAAAGTTGGTACATCTGGTCCTGACGGCTATCCTGGATATTTGGGTC
CACCAGGACCCTCGGGTCCACAGGGGATAACAGGAAAACCAGGAAAAAAA
GGTCCTCCAGGAAATCGTGGTCCAATAGGAGATAATGGAGAAATGGGCAA
ACCTGGTCCGGTCGGTGTGGTTGGAGATCCTGGAATTACTGGTCCAGTAG
GTCCACGAGGTGACCCTGGACCTCCTGGTAGTAAAGGAGCTGCAGGCAGT
CCAGGTAAAGATGGAACAAGTGGTGGGAAAGGATATGATGGCCAAGGAGG
AACTAGCGGAAAATCAGGAAAGTTAGGAACTCCTGGTGATAAAGGTGCAG
ATGGTAAGCCTGGACCAGGCGGTGTACTTGGAGATGTTGGTTTGAGAGGA
GCAAAAGGAAAAAAAGGAAAAGATGGAGTACCAGGACGTGAAGGAAGACC
GGGTCCTAGAGGTGCAAAAGGTGTAAAAGGACCAAAAGGAGTCCCTGGAT
ATCCAGGATTAATAGGGATAGAAGGAGATCCAGGACCACCAGGTTTTCCT
GGTCCAGATGGGCCTGTTGGACCTAAGGGTCCTGATGGAATACAAGGTCC
AGTTGGAATAACTGGTCCCCGTGGTGATCCTGGAATACCAGGAGCTATTG
GTCCAACTGGTCTTTCGGGTAAGAAAGGTTCAAATGGTCCTTTAGGCCAA
TTGGGACCATTAGGACCTCCAGGTCCTCCTGGACCACCAGGTGGTATTAT
GGCTATGCAACTACGTTCTCCTACAAAGGGTGTTGTTTATGGAGATGATC
CTGCTGCCGCTGAACTTCTTGGAAATAATGCCATTAAAAATCCTCAAGGC
ACCAAAGAAGTTCCAGCTAGAACATGTAAACAGTTGAGTTCAGAAAATCC
AAATATTCCTGATGGCGAATATTGGATAGATCCAAATGGTGGTAGGACAA
GTGATGCTGTCAAAGTCTTTTGTAAAATATCTGAAGAAAAAACCTGTATT
AAACCATTAAATCAAGAAATTAGTATTCGATCATGGAATAGTCCGATTAC
CAAAGGGCACACATGGTTGAAATCATTATTAAATTCAGACGAGATCCAAT
ATTCAATACCAAATGGTCAACTCGCATTTCTCAAAATTTATAGTGACACA
GCCACTCAAAGAATCACATTTGCTTGTGAAAATCATCCGATAATTGGCAG
TGAAGAAAAATTAAATAAAGTAATGGCACCCCGTTTATTAGCTGATGATG
ATACCATAATTAAAATGACACAATCAAAATTGAAATACACAGTTATCAAA
GATGAGTGTCAATATTCCAAATCCAGTGAAGCTGAAAGTATAATAGAATT
GAAACATTTGGCGAGCTTACTACCGATTCGTGATATTGGCATCAATATTA
TCAATAATAGAAAGAGTAAATTTGGATTGACAATAGAAGAAGTCTGCTTT
TCATAATTTCTGACTTTAAAAAAATCAATTTACATTTTTGCTAATAGTTG
TATATTAAATGAAATTTATTTTGTTCTGTAATTAATCAATAATAAACGTA
TTGATTTTTTTCAAAAAAAA

Design Primers for collagen

back to top

protein sequence

>Gsp_012135-protein ID=Gsp_012135-protein|Name=collagen|organism=Girardia sp.|type=polypeptide|length=1286bp
MLKISIISGAILLVLIYIECVHGQFRTLNEATGPMGIRGNPGKRGKPGPD
GDPGAQGPTGPPGNDGQRGAPGLQGPPGSVGPEGKSGAGGVSGPPGSRGE
QGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEQGDTGDIGPPGPPGPIG
QPGQNGPTGPQGPVGERGTHGPDGIVGPPGPEGRIGSPGLPGRPGEIGKK
GEGGDEGAKGGKGESGKTGINGKSGKPGLRGRTGPDGNNGKQGRKGEIGD
VGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKG
QKGDRGRPGPQGEAGLIGPTGPIGPDGVKGSRGESGPNGSPGSPGPNGKP
GADGPPGNPGHPGNSGPGGSTGALGRPGPNGAPGPRGEIGPPGPDGKSGN
KGALGPIGPSGLRGPPGNPGKDGMIGPLGLPGLRGPPGNVGAPGNKGNIG
EPGPKGKTGNSGKPGPAGKNGADGSEGLSGNIGSPGFPGPNGDPGASGPP
GPAGAIGIPGYPGNQGISGKNGAPGTEGNPGKSGPVGPVGKAGVPGPIGD
SGNPGPPGPNGLKGAPGIVGIPGNYGPNGPPGALGPLGKQGPEGPVGAPG
LLGSPGPDGPRGPPGDNGSPGSAGGEGVEKHKPRSVGAAGIRGPVGKPGM
DGLPGDLGPPGKQGPEGNPGDPGDPGDMGPIGASGDSGPDGKPGKPGAPG
KQGPEGLSGPEGASGPPGDVGDPGKSGPGGPRGGPGRQGSPGGPGKQGKS
GIQGKPGPAGKPGPLGPQGVTGLDGSGGRKGPLGLRGPVGPKGKPGAPGT
DGKVGTSGPDGYPGYLGPPGPSGPQGITGKPGKKGPPGNRGPIGDNGEMG
KPGPVGVVGDPGITGPVGPRGDPGPPGSKGAAGSPGKDGTSGGKGYDGQG
GTSGKSGKLGTPGDKGADGKPGPGGVLGDVGLRGAKGKKGKDGVPGREGR
PGPRGAKGVKGPKGVPGYPGLIGIEGDPGPPGFPGPDGPVGPKGPDGIQG
PVGITGPRGDPGIPGAIGPTGLSGKKGSNGPLGQLGPLGPPGPPGPPGGI
MAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSEN
PNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPI
TKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIG
SEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIE
LKHLASLLPIRDIGINIINNRKSKFGLTIEEVCFS*
back to top

coding sequence

>Gsp_012135.4299.8156 ID=Gsp_012135.4299.8156|Name=Gsp_012135.4299.8156|organism=Girardia sp.|type=CDS|length=3858bp
ATGCTCAAAATCTCAATTATTTCTGGAGCTATTTTATTGGTTTTAATTTA
TATTGAATGTGTACATGGTCAATTCAGAACTTTAAATGAAGCTACTGGTC
CCATGGGAATCAGAGGAAATCCTGGAAAGAGGGGGAAACCAGGTCCTGAT
GGAGATCCTGGTGCACAGGGTCCAACAGGTCCTCCTGGAAATGATGGACA
AAGAGGTGCACCTGGGTTGCAGGGTCCACCTGGTTCTGTTGGGCCTGAAG
GTAAATCTGGAGCTGGTGGTGTTTCAGGGCCGCCTGGATCAAGAGGTGAA
CAAGGTCCACCTGGGCCTATAGGTAAAGAAGGTTTAACAGGTCCAGTAGG
ATATCCAGGACCATCTGGAGAAAAAGGAGACAGTGGGGGTACTGGTGAAC
AGGGTGATACAGGAGATATTGGTCCACCAGGTCCTCCTGGACCAATTGGT
CAACCTGGTCAAAATGGTCCTACTGGTCCACAAGGTCCAGTCGGAGAAAG
AGGTACTCATGGACCTGATGGAATTGTAGGTCCACCTGGTCCAGAAGGTA
GAATTGGTTCTCCAGGATTACCAGGCCGACCTGGTGAAATAGGCAAAAAG
GGTGAAGGAGGAGATGAAGGAGCAAAAGGTGGGAAAGGAGAAAGCGGTAA
AACTGGAATAAATGGTAAATCTGGAAAGCCAGGTTTAAGGGGACGAACAG
GTCCTGATGGTAATAATGGGAAACAAGGAAGAAAAGGTGAAATTGGTGAT
GTTGGATTGCTTGGTCCACAAGGATTAACAGGACCACGAGGGTTGCGTGG
ATCATCAGGTAATCCAGGAGATTCTGGTCCTAAAGGTTCTCAAGGAGATC
AAGGTCCAATCGGATTGGAAGGAAAATCCGGACCATTTGGTCCGAAAGGA
CAAAAAGGAGATAGAGGTCGTCCAGGACCCCAGGGAGAAGCCGGACTGAT
TGGACCAACTGGTCCAATAGGCCCAGATGGAGTTAAAGGATCAAGAGGTG
AAAGCGGACCTAATGGTAGTCCCGGAAGTCCAGGACCAAATGGGAAACCA
GGAGCTGATGGTCCACCTGGAAATCCAGGTCATCCTGGAAACTCTGGTCC
TGGTGGTTCGACAGGTGCTTTAGGAAGACCTGGTCCGAATGGAGCTCCTG
GACCCAGAGGAGAAATTGGACCCCCAGGTCCTGATGGAAAATCTGGAAAT
AAAGGTGCCTTAGGACCGATAGGACCTTCTGGATTACGTGGACCCCCAGG
CAATCCCGGGAAAGATGGTATGATAGGCCCTCTAGGACTTCCAGGATTAA
GAGGACCACCAGGCAATGTTGGAGCACCTGGAAATAAAGGAAACATCGGT
GAACCAGGACCAAAAGGCAAAACTGGAAATTCTGGAAAACCTGGACCAGC
TGGAAAGAATGGGGCTGATGGCAGTGAAGGCCTGAGTGGAAATATTGGTA
GTCCTGGTTTCCCAGGACCAAATGGAGATCCTGGAGCCAGTGGACCTCCT
GGTCCAGCAGGTGCTATTGGAATTCCTGGATATCCTGGAAATCAAGGTAT
AAGCGGTAAAAATGGTGCACCTGGAACTGAAGGAAATCCTGGAAAATCTG
GACCGGTAGGTCCTGTTGGTAAAGCGGGAGTTCCTGGTCCTATTGGTGAC
AGTGGAAATCCTGGTCCTCCTGGACCAAATGGATTAAAAGGAGCTCCTGG
TATAGTTGGAATTCCAGGAAATTATGGGCCGAATGGCCCACCAGGTGCTC
TCGGTCCATTGGGAAAACAAGGACCCGAAGGTCCGGTTGGTGCACCAGGT
TTGCTTGGCAGTCCAGGACCAGATGGGCCAAGAGGTCCTCCAGGGGACAA
TGGTTCACCAGGATCTGCAGGAGGAGAAGGAGTAGAAAAACACAAACCTC
GCAGTGTTGGAGCTGCTGGTATTAGAGGACCAGTTGGAAAGCCAGGTATG
GATGGATTACCCGGTGATCTGGGACCACCTGGAAAACAAGGTCCAGAAGG
TAATCCTGGCGACCCTGGGGATCCTGGAGATATGGGACCTATTGGTGCTT
CTGGTGATTCTGGACCAGACGGAAAGCCAGGCAAGCCTGGAGCACCAGGA
AAACAAGGACCTGAAGGATTAAGTGGCCCAGAAGGTGCTTCTGGTCCACC
CGGAGATGTTGGTGATCCTGGAAAATCTGGCCCAGGTGGGCCGCGCGGTG
GTCCTGGCAGACAAGGTAGTCCAGGAGGTCCAGGGAAACAAGGAAAATCT
GGTATTCAAGGTAAACCTGGGCCTGCTGGAAAACCTGGACCTCTTGGTCC
TCAAGGAGTTACAGGTTTAGATGGATCAGGTGGTCGAAAAGGTCCTTTAG
GTTTAAGAGGTCCAGTAGGTCCAAAAGGTAAACCCGGAGCTCCTGGTACT
GATGGAAAAGTTGGTACATCTGGTCCTGACGGCTATCCTGGATATTTGGG
TCCACCAGGACCCTCGGGTCCACAGGGGATAACAGGAAAACCAGGAAAAA
AAGGTCCTCCAGGAAATCGTGGTCCAATAGGAGATAATGGAGAAATGGGC
AAACCTGGTCCGGTCGGTGTGGTTGGAGATCCTGGAATTACTGGTCCAGT
AGGTCCACGAGGTGACCCTGGACCTCCTGGTAGTAAAGGAGCTGCAGGCA
GTCCAGGTAAAGATGGAACAAGTGGTGGGAAAGGATATGATGGCCAAGGA
GGAACTAGCGGAAAATCAGGAAAGTTAGGAACTCCTGGTGATAAAGGTGC
AGATGGTAAGCCTGGACCAGGCGGTGTACTTGGAGATGTTGGTTTGAGAG
GAGCAAAAGGAAAAAAAGGAAAAGATGGAGTACCAGGACGTGAAGGAAGA
CCGGGTCCTAGAGGTGCAAAAGGTGTAAAAGGACCAAAAGGAGTCCCTGG
ATATCCAGGATTAATAGGGATAGAAGGAGATCCAGGACCACCAGGTTTTC
CTGGTCCAGATGGGCCTGTTGGACCTAAGGGTCCTGATGGAATACAAGGT
CCAGTTGGAATAACTGGTCCCCGTGGTGATCCTGGAATACCAGGAGCTAT
TGGTCCAACTGGTCTTTCGGGTAAGAAAGGTTCAAATGGTCCTTTAGGCC
AATTGGGACCATTAGGACCTCCAGGTCCTCCTGGACCACCAGGTGGTATT
ATGGCTATGCAACTACGTTCTCCTACAAAGGGTGTTGTTTATGGAGATGA
TCCTGCTGCCGCTGAACTTCTTGGAAATAATGCCATTAAAAATCCTCAAG
GCACCAAAGAAGTTCCAGCTAGAACATGTAAACAGTTGAGTTCAGAAAAT
CCAAATATTCCTGATGGCGAATATTGGATAGATCCAAATGGTGGTAGGAC
AAGTGATGCTGTCAAAGTCTTTTGTAAAATATCTGAAGAAAAAACCTGTA
TTAAACCATTAAATCAAGAAATTAGTATTCGATCATGGAATAGTCCGATT
ACCAAAGGGCACACATGGTTGAAATCATTATTAAATTCAGACGAGATCCA
ATATTCAATACCAAATGGTCAACTCGCATTTCTCAAAATTTATAGTGACA
CAGCCACTCAAAGAATCACATTTGCTTGTGAAAATCATCCGATAATTGGC
AGTGAAGAAAAATTAAATAAAGTAATGGCACCCCGTTTATTAGCTGATGA
TGATACCATAATTAAAATGACACAATCAAAATTGAAATACACAGTTATCA
AAGATGAGTGTCAATATTCCAAATCCAGTGAAGCTGAAAGTATAATAGAA
TTGAAACATTTGGCGAGCTTACTACCGATTCGTGATATTGGCATCAATAT
TATCAATAATAGAAAGAGTAAATTTGGATTGACAATAGAAGAAGTCTGCT
TTTCATAA
back to top
Gene Groups
collagen is similar in sequence to the genes of this group: GG1008
Gene NameGene ID
SMU15002271SMU15002271
SMU15040033SMU15040033
Ddo_001537Ddo_001537
collagenDdo_024746
collagenGsp_012135
Gsp_016934Gsp_016934
Pgr_004907Pgr_004907
collagenPgr_010667
Pmo_000089Pmo_000089
collagenPmo_027332

Gene Group Protein Sequences

>SMU15002271

MWNSIFFSLLFVLCVSINARAEEKLNRLKRQATSPNQAKGSQGPRGDPGP
MGKPGPPGDPGALGPIGPPGRDGARGKSGKPGASGIPGKDGTPGSHGVVG
PIGPRGEPGLAGSIGPEGGTGPQGNRGLTGDKGDIGLAGLKGSNGEPGLQ
GPQGLRGPAGRVGPAGIQGPTGERGKQGTDGVPGSLGPQGAIGPPGQSGI
PGEIGNKGIRGESGIKGAKGDSGNPGLAGKTGPSGSLGPPGYPGVDGRPG
VRGEAGIVGPQGPVGKVGQRGQRGPSGNPGLSGPKGSQGEEGPIGIEGKQ
GSAGPKGQKGDPGRPGETGDEGPRGERGVVGPAGNKGSRGESGPDGSPGN
PGTDGIPGKDGLHGNPGSQGEVGSRGSPGAMGKLGLNGAPGPRGENGVFG
TNGHPGAKGAVGPKGNMGSPGVRGLPGNTGTMGVMGPGGIRGPLGPVGSP
GDKGSKGRTGITGVAGDSGDLGEQGPPGEDGSEGPSGSPGPMGFPGIAGK
IGQPGPIGPDGPPGPAGFEGTPGTNGKDGKAGKDGKPGPPGEVGPPGVKG
SIGPVGETGLMGKIGLRGPKGLDGIMGPPGTFGMNGPPGPSGEAGPSGAP
GKLGPVGITGSRGRPGPIGSSGEPGEKGPQGLPEVEKPLAGRSVGGVSGP
RGERGSPGDDGLPGPNGDPGPPGPIGMDGGRGDTGDRGLPGPAGDPGKDG
KPGEPGTPGIDGPPGQVGPEGPRGPSGETGEQGPQGLPGKPGDPGAEGPR
GSSGKQGFQGPTGPIGPTGKQGKQGRAGKSGKNGLTGRKGPAGQRGSIGA
RGKDGQTGENGRAGSAGADGFPGFPGPNGPPGPTGPDGKMGPIGPPGEIG
EHGEMGDPGPTGKEGLVGPQGNVGPPGPIGSPGNPGIAGPPGPQGKRGNR
GNTGFVGPAGPRGKRGPVGPGGEKGIPGSPGLEGQRGQIGVSGARGNNGN
NGKPGRTGRAGPAGTMGQKGVRGVKGFTGPNGLKGPQGPSGYPGEDGSPG
PMGLIGERGPQGITGKRGDRGDPGLLGPLGPPGNDGEFGRPGPQGPMGPP
GPPGPPGSSMPMGLRSPTKGLTFSDDPSVAHSFGNNAIITPRGTKEVPAR
SCKHLSEHNPDLSDGEYWIDPNGGRVSDAVPVYCRIATQQTCIKPISKIY
KTASWFKKYQKDHVWFQTINGIGEFEYDIENYQLNYLKALSETATQQISL
NCINQAIILDRQGKMSTVWTSLLGDDDTILSLQHPKRRFKVIKDECQYEK
FSEAETILEVRGKASRLPIKDVGLIIDSDRSRKVGIELGEVCYS*
>SMU15040033
MFKNSIFSGAILLILIYVDFSYGQFRTLNEATGPIGIRGNPGKRGKIGPD
GDPGSSGPPGPPGKDGLRGAPGPNGPAGGAGPDGKSGVTGNTGPPGSRGE
QGPPGPVGKEGLTGPNGYSGPSGEKGDSGSIGEQGDPGDIGPQGPAGPLG
PPGQSGPTGPQGTVGERGPHGPDGVVGPPGPEGRMGSPGSPGRPGELGKK
GEGGDEGLKGGKGENGKTGINGKSGKPGIRGPIGPVGINGKQGRKGELGD
IGLTGPQGLIGPRGVRGTVGNPGDNGPKGSQGDQGPIGLEGKPGPFGPKG
QKGDRGRPGPQGETGPLGPGGPIGPDGAKGSRGEIGPNGSPGTPGPNGKP
GATGPPGTPGHPGNAGPGGQAGPIGRPGPNGAPGPRGEIGPNGPDGKSGR
KGSLGPTGLTGLRGPQGNPGKDGTLGPLGTPGLRGPPGSIGTPGLKGNIG
PPGSKGKVGNAGKPGPLGKNGIDGSEGPIGNAGSPGFPGPNGDPGPNGPP
GSLGLAGLVGYPGNQGLAGKNGNPGVEGKPGKAGTPGSPGKPGVPGPVGD
VGNIGPPGPNGLKGAPGIYGVPGNYGPNGSPGDLGPLGKQGPEGLVGAPG
LAGSPGPDGPRGPPGANGSPGSAGGEGVEKHKPRSVGSAGIRGPEGKPGM
DGFPGDVGPPGKQGPEGGPGDPGDPGDMGPIGNSGDPGPDGKPGKNGAPG
KQGPEGLPGSEGASGPPGDVGDPGKSGPTGPRGGPGRIGSPGGQGKQGKS
GNQGKLGPSGKPGPVGSPGVTGLDGTIGRKGPLGLRGPSGPKGKSGAPGV
DGKVGTQGVNGYPGYLGPPGPPGPQGPNGKPGKPGPPGNVGQIGDHGEMG
NPGPQGSVGPPGATGPVGPRGDAGEPGRKGPIGPQGKNGTSGGKGYDGQS
GTSGKSGKVGTPGDKGGDGKPGPSGVLGDVGLRGAKGKKGKDGVPGREGR
PGARGSKGVKGPKGVPGYPGRPGVEGDPGPPGYPGPDGPVGPKGPDGLQG
PVGIIGPRGDPGIPGPIGPTGLHGKKGGIGIMGPVGPLGPPGPPGPPGGI
MAMQMRSPTKGVTYGDDPLAAELLGNNAIKNPEGTKEVPAITCKQLSVKH
PNLPDGEYWIDPNGGRVNDAVKVYCRISEQKTCIKPINNEISLRSWKSHS
ANGHTWLKSILNKEEIQYSIPNGQIAFLKVNSDSAVQRVTFTCENHPIIG
NEEKLNKVTAPRLLADDDTIIKMTHSHLKYTVIKDECQYSKSSEAESIIE
VRNYANLLPIRDIGVSIINNRKSKFGVTIEEVCFS*
>Ddo_001537 Ddo_001537
MHKFTFLFAIGVLCFSVFAENEKLHRLKRQATSPNQAKGSQGPRGDPGPM
GKPGPEGDSGMMGPVGPPGRDGARGKSGKPGLPGIPGKDGTPGSHGVSGP
MGPPGEIGVSGPVGPEGANGKQGNRGPTGDKGDMGLTGLKGSSGEPGLQG
PQGLRGPIGRVGPTGPTGPTGDRGKQGPDGIPGSVGPQGAIGPPGQPGIA
GEIGNKGVRGEAGIKGSKGDHGNPGMGGKLGSIGQPGPPGLPGVDGRPGI
RGEVGVPGLQGPEGKAGQRGQRGAPGNPGSQGPKGSQGEEGSPGIEGKPG
PPGIKGQKGDNGRPGENGDEGPRGERGLNGPSGNKGSQGESGPDGSPGNP
GTDGIPGKDGIAGSPGNEGEAGPKGPPGPMGKPGLNGAPGPRGENGVYGV
NGHPGAKGSVGPKGSIGSPGPRGLPGNNGPMGSMGPGGIRGPLGPVGSPG
EKGGKGRNGIPGASGDAGDIGELGPPGEDGSEGSSGSPGPAGFPGVSGKM
GQIGPIGVDGPPGPPGFEGTPGINGKDGKSGKDGKPGPPGEIGPPGINGS
TGPVGEIGPTGKIGLKGLKGADGIMGPPGTFGINGPPGLSGPMGPVGING
KRGPPGTPGKEGVPGAIGPQGIPGKRGSKGGDGVDKPLAGRSGGGPPGPK
GERGPQGEDGLSGPNGNPGPVGPPGIDGGRGDTGDRGLPGPIGDVGKDGK
PGEPGTPGQDGPPGPVGPEGPRGPPGETGDQGPQGLPGRPGESGLEGPRG
SPGKQGSQGSSGPVGAVGKPGKPGKPGKPGKDGMIGRKGPVGQRGPTGPR
GKDGPAGENGKPGSPGPDGFSGFPGSSGPPGPIGPDGKGGPPGPPGETGE
VGEMGDIGPLGKEGPQGPQGNDGPPGPIGSPGVPGNAGPPGPPGKRGNRG
NTGFVGGVGPRGKRGPVGADGEKGIPGQPGANGAKGQIGIPGPRGKLGNN
GKPGKIGRVGPPGTTGQKGTRGVKGFTGPNGLRGPQGLSGYPGEDGPPGP
IGLIGERGPQGVTGKRGDRGDPGAAGPNGPQGNDGEYGRPGPQGPVGPPG
PPGPPGSSMPMGLRSPTKGLTLADDPSVALSFGNNAIHTPRGTKDVPARS
CKLLSEINPTLPDGEYWIDPNGGRIDDAVKVYCKMSLAQTCIKPISKSFK
AKQWIRKYSKDHIWFQTTHEAGEFQYDIENYQLTYLKVHSDTATQQISLS
CINQAIVLDKNGKLSNASTSLLGDDDTILSLRHPKRRFKVVKDDCQYEKS
SEAETILEIRGKASRLPIRDVGLLVDDNPERKIGIELSEVCYS*
>Ddo_024746 collagen
MLKISILSGCVFLILLFIGNSYGQFRTLNEAVGPAGIRGNPGKPGKPGPD
GDPGPTGPPGPTGKDGLRGQPGAPGPNGNAGPDGKPGSTGVTGPPGARGE
QGPAGPVGKEGLSGAPGFVGPTGEKGDTGPPGEQGDLGDIGPPGRAGPIG
PPGQSGPTGPQGNVGERGPVGPDGMIGPPGPEGKIGSPGSPGRPGEVGKK
GEGGDEGIKGGKGEAGKTGLDGKSGKPGTRGPIGPVGINGRPGKKGELGD
IGLTGPQGISGPRGKRGRVGNPGEGGPKGSQGDQGPIGLEGRAGVLGPKG
QKGDRGRPGELGEAGPLGPIGPIGPDGVKGSRGESGPNGAPGDPGLNGKP
GAIGPPGIPGTPGDAGPGGQTGPMGRPGPNGAPGPRGEVGPNGPDGKSGN
KGAIGPTGQPGVRGSIGNPGKDGALGPMGTSGMRGPPGNIGVEGSKGNKG
PPGSRGKVGSAGKVGPPGKPGVNGSEGPVGNMGNPGFPGSSGDPGAPGPK
GPVGLIGPVGYAGSQGVNGKNGDPGNVGKPGKVGPPGAPGKPGIPGPDGD
RGSLGPPGPTGLKGSPGIIGVPGNYGPNGPPGSLGVIGKQGLEGELGPPG
TPGSPGGTGPRGPPGPAGTPGNAGGEGVDKQKPRSVGSAGIRGPIGKPGA
DGIPGDVGPVGKQGSEGPPGDPGDPGDMGSVGAIGDSGKDGNPGKPGAAG
KPGPSGVDGPEGPAGLPGDIGEPGKSGPSGPRGDPGRVGSAGGPGKQGKS
GNQGKQGPSGKPGPTGPPGLAGLDAANGRKGPLGLRGPPGPKGKSGGPGI
DGKVGTPGTDGFPGYLGPPGQPGPQGIAGKPGKPGPTGGIGLQGDHGEMG
KPGPQGPLGVPGLTGPVGPRGDPGPPGSKGAAGKPGKDGTSGGKGYDGQT
GTSGKSGKIGAPGEKGADGKPGPNGLLGDTGLRGAKGKKGTDGTPGREGK
PGPRGAKGTKGPKGVPGYPGRPGIEGDPGPPGYSGPDGPTGPKGPDGLQG
PVGIFGPRGDPGIPGAIGPTGPPGKPGGTGPAGQRGPLGPPGPPGPPGGV
MAMQMRSPTKGVVYGDDPKAAELLGNNAIKNPQGTKEVPARTCKHLSSVN
PQAPDGEYWIDPNGGRIEDAVKVYCKISEQKTCIKSIDNNLSIRSWNSPI
LNGPTWLQKLINKNEIQYSVPNNQLAFLKVYSDNAVQRVTFNCENHPIIG
NEKIFNKVTAPRLLADDDSIIKMTHPKLKYSVIKDECQYSKSSEAESVIE
VKDVANLLPIRDVGIHIINNRKSKFGLTIEDVCFS*
>Gsp_012135 collagen
MLKISIISGAILLVLIYIECVHGQFRTLNEATGPMGIRGNPGKRGKPGPD
GDPGAQGPTGPPGNDGQRGAPGLQGPPGSVGPEGKSGAGGVSGPPGSRGE
QGPPGPIGKEGLTGPVGYPGPSGEKGDSGGTGEQGDTGDIGPPGPPGPIG
QPGQNGPTGPQGPVGERGTHGPDGIVGPPGPEGRIGSPGLPGRPGEIGKK
GEGGDEGAKGGKGESGKTGINGKSGKPGLRGRTGPDGNNGKQGRKGEIGD
VGLLGPQGLTGPRGLRGSSGNPGDSGPKGSQGDQGPIGLEGKSGPFGPKG
QKGDRGRPGPQGEAGLIGPTGPIGPDGVKGSRGESGPNGSPGSPGPNGKP
GADGPPGNPGHPGNSGPGGSTGALGRPGPNGAPGPRGEIGPPGPDGKSGN
KGALGPIGPSGLRGPPGNPGKDGMIGPLGLPGLRGPPGNVGAPGNKGNIG
EPGPKGKTGNSGKPGPAGKNGADGSEGLSGNIGSPGFPGPNGDPGASGPP
GPAGAIGIPGYPGNQGISGKNGAPGTEGNPGKSGPVGPVGKAGVPGPIGD
SGNPGPPGPNGLKGAPGIVGIPGNYGPNGPPGALGPLGKQGPEGPVGAPG
LLGSPGPDGPRGPPGDNGSPGSAGGEGVEKHKPRSVGAAGIRGPVGKPGM
DGLPGDLGPPGKQGPEGNPGDPGDPGDMGPIGASGDSGPDGKPGKPGAPG
KQGPEGLSGPEGASGPPGDVGDPGKSGPGGPRGGPGRQGSPGGPGKQGKS
GIQGKPGPAGKPGPLGPQGVTGLDGSGGRKGPLGLRGPVGPKGKPGAPGT
DGKVGTSGPDGYPGYLGPPGPSGPQGITGKPGKKGPPGNRGPIGDNGEMG
KPGPVGVVGDPGITGPVGPRGDPGPPGSKGAAGSPGKDGTSGGKGYDGQG
GTSGKSGKLGTPGDKGADGKPGPGGVLGDVGLRGAKGKKGKDGVPGREGR
PGPRGAKGVKGPKGVPGYPGLIGIEGDPGPPGFPGPDGPVGPKGPDGIQG
PVGITGPRGDPGIPGAIGPTGLSGKKGSNGPLGQLGPLGPPGPPGPPGGI
MAMQLRSPTKGVVYGDDPAAAELLGNNAIKNPQGTKEVPARTCKQLSSEN
PNIPDGEYWIDPNGGRTSDAVKVFCKISEEKTCIKPLNQEISIRSWNSPI
TKGHTWLKSLLNSDEIQYSIPNGQLAFLKIYSDTATQRITFACENHPIIG
SEEKLNKVMAPRLLADDDTIIKMTQSKLKYTVIKDECQYSKSSEAESIIE
LKHLASLLPIRDIGINIINNRKSKFGLTIEEVCFS*
>Gsp_016934 Gsp_016934
MYNLIRLFIAILCVLSVPVIVNGEAKLNRIKRQATSPRQAKGSQGPRGDP
GPMGKPGPQGDPGPLGPVGPPGRDGARGKSGRPGASGVPGKDGTPGSHGV
IGPIGPRGEPGIAGSIGPEGASGPQGNRGYPGDKGDIGLNGLKGSNGEPG
LQGPQGIRGTSGRVGPTGPQGPPGERGKQGPDGIPGSPGPQGTIGPPGQS
GITGEMGNKGIRGEGGIKGSKGDVGNPGLSGKVGPAGPTGPPGFPGVDGR
PGVRGESGVVGQQGPVGKVGQQGQRGKTGNIGPPGPKGAQGDEGPMGIEG
KQGPPGPKGQKGDPGRPGETGDEGPRGERGIVGPSGNKGARGETGQDGSP
GNPGTDGIPGKDGLHGSPGSEGEVGSVGPPGEVGKPGLNGAPGIRGENGA
FGVNGKPGAKGSVGPKGNIGSPGPRGLSGNNGAMGVMGPGGIRGPIGPVG
SPGEKGGKGRVGIPGVSGDAGDIGEQGPPGEDGAEGSSGNPGPIGFPGIA
GKIGQAGPLGVDGPPGPPGFEGLPGVNGKDGKAGKDGKPGPPGEVGPPGV
KGSIGQFGETGPMGKIGIRGPKGMDGIMGPPGVFGMNGPPGPSGEAGPEG
APGKLGPVGIAGRPGRPGAIGPQGIAGEKGPKGLPEVDKPLAGRSSGGPP
GPKGERGPPGDDGLPGPNGNQGPPGPVGMDGGRGDTGDRGLPGPVGDPGK
DGKPGEQGVPGTDGPQGPLGPEGPRGPSGETGEQGPRGLPGKPGEPGGEG
PRGSPGKQGLQGPTGPVGPTGKQGKQGKPGKSGKNGVTGQKGPVGSRGPL
GPRGKDGPLGESGKPGSPGADGFSGFPGPIGPPGAIGPDGKQGPVGQPGE
PGEHGEMGDAGPNGNEGPLGPQGNVGPPGPTGPPGNPGNIGTPGPQGKRG
NRGNTGFVGAAGQRGKRGPLGPPGDKGIPGSPGMAGLKGQTGVQGPRGKN
GNNGKPGRIGRSGPPGSMGQKGVRGVKGFTGPNGLKGPQGSSGYPGEDGP
PGPIGLIGERGPQGITGKRGDRGDPGLLGPPGPLGNDGEYGRPGPQGPMG
PPGPPGPPGSSMPMGLRSPTKGLTMADDPTAAISFGNNAIITPRGTKDVP
ARSCKHLSEHNPDLPDGEYWIDPNGGRIDDAIPVFCRIELQQTCIKPISK
IFKPVQWIRKYARGHIWFQTVNEIGEFEYDIESHQLNFLKALSETATQQI
TFTCINQAIILDKTGKLSNASTKLLGDDDTFLTFHHPKRRYKVVKDECQY
EKFSEAETVLQIQGKASRLPIKDVGLITDSNTSRKVGIELSEVCYS*
>Pgr_004907 Pgr_004907
MLRLVCFFLISSVCLTTVLSKEELINRIKRQATSPNQAIGPQGPRGDPGP
RGKIGQIGDTGAIGPPGPPGRDGAAGKSGKPGPPGLPGKDGRPGTPGNSG
ARGTRGEPGVAGPLGPEGGTGKQGSKGAPGDKGDIGPNGLKGSNGEAGLQ
GIRGPRGSPGPVGPNGPPGSAGDRGKRGPDGRMGGAGPQGPIGPPGIPGL
LGDMGDKGERGDGGPKGVKGEAGNAGVSGKPGPTGPKGPQGSPGVDGRPG
IRGELGNIGPLGPPGVAGPRGQTGAPGNSGKLGPKGAQGEQGPVGLEGKI
GLIGDKGQKGDQGRPGDIGDPGVRGERGVEGIGGNKGPRGEMGPDGSPGN
GGTDGIPGRDGEQGQPGSIGPVGPRGPQGPQGKTGLNGSPGGRGENGPFG
LNGMPGGKGEPGVRGPPGLPGSRGSAGNNGLQGVMGPVGMRGPAGQVGEG
GDKGSPGTAGIAGMPGEAGDVGLQGPPGEEGVEGPSGSPGPAGFSGAVGK
IGQVGPIGINGPPGPPGYEGAPGVNGKDGKSGKDGKPGTPGDTGPVGPKG
ASGPLGERGPLGPLGIKGIKGFEGVMGIPGSFGTNGPPGLSGNIGAEGPV
GKPGTAGLPGLAGAPGQRGPIGDSGEPGSSGAQGVEKPPAGRSSGPPGPK
GNRGPPGDNGTPGAVGTQGPPGPPGIDGDRGDPGDRGLPGPIGEPGKDGK
LGKVGGSGKNGPPGQDGPEGPRGAPGDVGEPGRQGPEGKPGPDGPEGPRG
IPGLQGKRGPSGPIGPIGPPGKQGKTGSPGSDGVIGRKGPPGARGPQGPR
GKDGVAGENGKSGTPGPDGFAGFPGPSGPQGIIGGVGKPGNPGSPGETGE
QGEFGDIGPPGPPGPDGLNGNPGPPGPSGIPGSIGERGPPGPGGKKGPMG
NTGFIGMPGPRGGRGKPGLIGPMGLVGGVGLQGPKGPPGEPGDRGKDGVD
GKPGGPGRPGPTGSIGIKGARGFKGFTGPTGIRGNPGPSGYPGENGMPGP
MGLIGERGPQGAPGQRGERGDTGPAGMIGPQGNDGEYGRPGPQGPMGPSG
PPGPPGSSTLMNLRAPKKNLIYGDDPSSALILGNSVIKTPRGTKEVPART
CKHLAEHNPKIPDGEYWIDPNGGRIQDAVKVFCKIDNYQTCIESTTKKFK
LQSWIRKYPKGPIWFQTTNNVGDFEYDVSDDQLSYLKALSDKASQQITIK
CFNKAIIRDKNGALLKTSPSLLGDDDTILTLEHPKRQYTIVRDDCQYEKN
SEAETVLEVNGKASRLPIKDIGLVIDSDKKVGIELSEVCYS*
>Pgr_010667 collagen
MLTNTFLIGVFLGILSSTEFVSGQFKSLQEAVGPPGIRGNPGPQGKPGPD
GDDGAAGPTGFMGKDGFRGPPGPVGPAGKPGTDGKAGNAGVTGSPGSRGE
PGLTGGMGQEGVAGPPGYPGLIGDKGDPGLPGEQGDVGPIGPVGPVGPKG
PIGQPGPTGKQGGTGNRGITGPDGKVGPPGPQGRMGSQGPPGNPGEMGNK
GEKGSEGRKGGKGSVGKPGLNAKPGKLGPTGQAGTPGKDGQQGRRGNPGN
IGVTGSQGPAGQRGPRGLIGNPGEVGPKGSEGDQGPIGVEGKPGNKGPNG
QKGDRGRPGQIGDNGPEGPIGVVGKPGTKGVRGEIGPNGRPGTPGQNGAP
GKNGPPGTPGNPGGTGPAGQPGSLGRPGPNGTPGPRGEPGPSGNDGTPGA
KGSPGPPGPPGQTGLAGNLGKDGDMGKSGPAGIRGPPGELGPDGAKGKTG
SPGLRGIPGDNGPVGKQGPPGLKGNDGNEGGQGSPGFAGPSGEVGKLGPV
GIPGAAGQRGYPGELGVHGKNGAPGSNGKPGKPGADGSVGKIGAPGPDGD
RGDSGPIGARGVTGQVGILGVPGSYGPNGASGPDGILGKPGPSGKPGEDG
APGIPGSPGPKGPPGEGGDPGNPGMPRISKPQPRGAGPAGPKGEAGKPGL
DGIPGEIGPSGKPGAPGPDGDPGDPGETGAPGQDGEEGKEGAPGAPGKPG
LLGPQGLEGNIGSPGPSGEAGDPGATGAPGPVGAPGRDGSLGLPGRQGKP
GEPGKEGAVGKVGSPGVIGAPGNNGADGRKGPLGPRGPPGDKGKPGGPGT
EGKPGSDGSPGDPGFMGSPGPSGPQGSLGKPGKPGPAGSDGLPGDYGEMG
VPGPAGPDGAPGNSGTAGARGPSGLPGGKGKPGPAGKAGKPGNGGFAGIP
GARGASGTPGPAGDAGPLGAPGSNGAIGETGLAGDKGVSGANGKPGQAGR
PGPRGPAGTKGPKGSPGFDGPTGADGDRGAIGYPGPNGPPGPSGPAGLTG
PTGVIGPRGDPGVLGPVGVAGLPGRQGESGPIGQRGPIGPPGPPGPPGGV
MAMQMRSPTKGIMYGDDSKTAKLLGGNAIKNPQGTKDIPAKTCKHLSVSY
PNKPDGEYWIDPNSGRIDDAVKVFCKISAQQTCIRPINGETPLKTWPSSS
VNGHSWIQALTMNNEVQYSIPNSQLSFLKAYSDSAIQRVTFSCDNHPIIG
INGKINKVTSPRLLADDDSVMRINHQKLKYTVISDDCQYGKASEAETIIE
VSNDASLFPIRDVGVTIISKSNSKFGLKFEEVCFS*
>Pmo_000089 Pmo_000089
MLRIVSLILICAIYIIAVMANEELVQRVKRQATSPNQAIGPQGPRGDPGP
RGKAGIPGDTGVLGPPGPPGRDGAAGKSGQSGPSGLPGKDGRPGTPGNVG
SRGPRGEAGVPGPIGPEGGTGKQGSKGPPGEKGDIGPNGLKGSNGESGLQ
GIRGPRGPPGPIGPNGPSGSAGERGKPGPDGRVGGAGPQGPIGPPGVPGL
PGDMGDKGERGDNGPKGVKGEAGNAGLSGKSGPIGPKGPQGSPGVDGRPG
IRGELGNVGPTGPPGSGGPRGQTGAPGNSGHMGPKGAQGEQGPVGLEGKL
GPAGMKGQKGDQGRPGDIGDPGLRGERGVGGIGGNKGPRGEMGPEGSPGN
SGTDGIPGRDGEPGQPGSAGPPGPRGPQGSQGNTGLNGSPGARGENGPFG
SNGIPGNKGEPGVKGPPGLPGSRGSTGNNGVQGIIGPTGIRGPAGQVGEG
GDKGVPGSTGIGGVPGEAGDAGIQGPPGEEGNEGPSGSPGPAGFSGSVGK
IGQTGPVGIDGPPGPPGYEGAPGPNGKDGKSGKDGKPGVPGDTGPSGQKG
TSGPVGERGPSGPSGVKGVKGFDGIMGIPGSFGANGPPGLSGNFGAEGPV
GKSGAPGLPGLPGTQGQRGPIGDSGDPGSSGAQGVEKPPAGRSAGPPGPK
GNRGPPGDNGQPGATGVQGPPGPPGIDGDRGDPGDRGLPGPIGEPGKDGK
AGKVGVAGKDGPPGQDGPEGPRGAPGDVGEPGRQGPEGKPGLDGPEGPRG
QSGLQGKRGPSGNVGPIGKPGKQGKTGSPGTDGSLGRKGPAGARGPQGPR
GKDGPVGESGKTGSPGSDGFAGFPGPSGPQGAIGPEGKPGISGSPGEAGE
QGEFGDIGVPGPPGNDGLPGNAGPPGPMGVPGSLGERGPPGPIGKKGPMG
NTGFVGMPGPRGGRGKPGLLGPMGLVGGVGLQGPKGPPGEPGERGKDGVD
GKAGGAGRPGPPGNLGIKGARGLKGFTGPTGMRGNSGPSGYPGEDGMQGP
IGLIGERGPQGIAGARGERGDTGPSGLIGPTGNDGEYGRPGPQGPMGPVG
PPGPPGSATLMNLRTPKKNLIYGDDPSAALILGNSVIKTPRGTKEVPART
CKHLAEHNPKLTDGEYWIDPNGGRIADAVKVFCKIDKYQTCIESSTKKFK
LQTWVRKYPKGPIWFQTTNGIGEFEYDVPDDQLSYLKALSDKASQQITVK
CMAQAIIRDKNGQLLKSTPSLLGDDDTILTLDHPKRQYTIVKDDCQYEKN
SDAETILEMSGKSSRLPIKDIGLVIDSDKKVGIELSEVCFS*
>Pmo_027332 collagen
MGKDGVRGPPGLIGPAGKPGSDGKAGVAGVAGSPGPRGEPGLTGSMGQEG
VAGPPGYPGLIGEKGDPGLPGEMGDVGNIGPTGIVGPKGPIGQPGPSGKQ
GITGNRGITGPDGKVGPPGPQGKMGPKGPPGNPGEMGNKGEKGSEGRKGG
KGSSGKPGLNAKPGKLGPIGIAGTPGKDGQQGRRGNPGNIGVMGSQGPSG
QRGSRGIIGNPGEVGPKGSEGDQGPIGLEGKPGNKGPNGLKGDRGRPGQI
GENGPEGPLGMEGKPGTKGVRGEIGPNGSPGTPGQNGVLGKNGPPGTSGN
PGGTGPTGQPGPLGRPGPNGTPGPRGEPGPGGNDGMPGAKGSPGPPGPPG
QPGLAGILGKDGDLGKSGPAGIRGPPGELGPAGGKGKIGSPGLRGIPGDT
GPVGKQGLAGQKGNDGHEGGQGTPGFVGPSGEPGKSGPVGSPGAGGQRGF
PGELGVHGKNGAPGNNGKPGKPGAPGPVGIIGAPGPDGDRGDNGPIGARG
ITGPSGVLGVPGTYGPNGAPGPEGTLGKQGPIGKSGLDGIPGLPGLPGPK
GPPGESGDPGPPGNPRISKPQPRGAGPAGPKGETGKPGLDGVPGEVGPSG
KPGATGPDGDAGDPGDAGAPGPDGEEGKSGALGAPGKPGLLGPMGLEGSV
GAPGPAGEAGDSGISGAPGPVGLPGRDGNLGLPGKQGKQGDPGKEGPVGK
PGGLGVPGVPGMNGANGRKGPLGLRGPQGDKGKTGAPGTDGKPGPDGTTG
DPGVMGSPGPSGPQGSLGKPGKPGLPGSEGLTGDYGELGVPGPVGMDGPP
GNPGTVGAPGPPGGPGGKGKPGAKGKPGKAGSGGFPGVPGSRGALGKPGP
SGDPGDAGELGPNGALGEAGLVGDKGVSGANGKPGQAGRPGSRGLPGVKG
PKGSPGFDGPTGSDGDRGPTGFSGPDGPPGPSGTEGPTGPTGVIGPRGDP
GVLGPIGVPGLPGRQGESGPIGQRGPIGPPGPPGPPGGVMAMQMRSPTKG
IMYGDDSMTAKLLGGNVIKNPQGTKDVPAKTCKHLSVSNPNKPDGEYWID
PNSGRIDDAVKVFCKISKQQTCIKPIDGDIAIKKWHTTNFNGHSWIHTLT
KKNEVKYSIPNSQLSFLKAYSEKAIQRVTFTCDNHPIIGNDGKINKVTSP
RLLADDDSILRINHPKLKYTVITDDCQYGKSSEAETVIEVSNDANIFPIR
DVGVTIISKNDSQFGLKFEEVCFS*

Created by

Powered By

Admin Log In

Education - This is a contributing Drupal Theme
Design by WeebPal.