April 9, 2019
Every time we download data through the Entrez module, we can interact with the results in different ways. First, we can just use the handle we obtain as an ordinary file handle and just store or process the raw data provided by Entrez. Second, we can process the data with an appropriate, existing Biopython module. The latter will generally be preferable if an appropriate module exists. However, the various choices that are available may make things confusing.
As an example, we will again download the genbank record with the ID "KT220438", containing an influenza HA protein. We will consider four different ways of looking at the data. First, we use retmode="text"
in Entrez.efetch()
and just download the raw data and print it. We get a regular genbank file as output:
from Bio import Entrez, SeqIO
Entrez.email = "wilke@austin.utexas.edu" # put your email here
# Download sequence record for genbank id KT220438 (HA from influenza A)
# Using text mode
handle = Entrez.efetch(db="nucleotide", id="KT220438", rettype="gb", retmode="text")
record = handle.read() # read file directly
print(record)
handle.close()
LOCUS KT220438 1701 bp cRNA linear VRL 20-JUL-2015 DEFINITION Influenza A virus (A/NewJersey/NHRC_93219/2015(H3N2)) segment 4 hemagglutinin (HA) gene, complete cds. ACCESSION KT220438 VERSION KT220438.1 KEYWORDS . SOURCE Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2)) ORGANISM Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2)) Viruses; ssRNA viruses; ssRNA negative-strand viruses; Orthomyxoviridae; Influenzavirus A. REFERENCE 1 (bases 1 to 1701) AUTHORS Sitz,C.R., Thammavong,H.L., Balansay-Ames,M.S., Hawksworth,A.W., Myers,C.A. and Brice,G.T. TITLE GEISS Influenza Surveillance Response Program JOURNAL Unpublished REFERENCE 2 (bases 1 to 1701) AUTHORS Sitz,C.R., Thammavong,H.L., Balansay-Ames,M.S., Hawksworth,A.W., Myers,C.A. and Brice,G.T. TITLE Direct Submission JOURNAL Submitted (29-JUN-2015) Operational Infectious Diseases, Naval Health Research Center, 140 Sylvester Rd., San Diego, CA 92106, USA COMMENT ##Assembly-Data-START## Sequencing Technology :: Sanger dideoxy sequencing ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..1701 /organism="Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))" /mol_type="viral cRNA" /strain="A/NewJersey/NHRC_93219/2015" /serotype="H3N2" /isolation_source="nasopharyngeal swab" /host="Homo sapiens" /db_xref="taxon:1682360" /segment="4" /lab_host="MDCK" /country="USA: New Jersey" /collection_date="17-Jan-2015" gene 1..1701 /gene="HA" CDS 1..1701 /gene="HA" /function="receptor binding and fusion protein" /codon_start=1 /product="hemagglutinin" /protein_id="AKQ43545.1" /translation="MKTIIALSYILCLVFAQKIPGNDNSTATLCLGHHAVPNGTIVKT ITNDRIEVTNATELVQNSSIGEICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDL FVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVTQNGTSSACIRRSS SSFFSRLNWLTHLNYTYPALNVTMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRI TVSTKRSQQAVIPNIGSRPRIRDIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKI RSGKSSIMRSDAPIGKCKSECITPNGSIPNDKPFQNVNRITYGACPRYVKHSTLKLAT GMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQ INGKLNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQ HTXDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHNVYR DEALNNRFQIKGVELKSGYKDWILWISXAISCFLLCVALLGFIMWACQKGNIRCNICI " mat_peptide 49..1035 /gene="HA" /product="HA1" mat_peptide 1036..1698 /gene="HA" /product="HA2" ORIGIN 1 atgaagacta tcattgcttt gagctacatt ctatgtctgg ttttcgctca aaaaattcct 61 ggaaatgaca atagcacggc aacgctgtgc cttgggcacc atgcagtacc aaacggaacg 121 atagtgaaaa caatcacaaa tgaccgaatt gaagttacta atgctactga gctggttcag 181 aattcctcaa taggtgaaat atgcgacagt cctcatcaga tccttgatgg agaaaactgc 241 acactaatag atgctctatt gggagaccct cagtgtgatg gctttcaaaa taagaaatgg 301 gacctttttg ttgaacgaag caaagcctac agcaactgct acccttatga tgtgccggat 361 tatgcctccc ttaggtcact agttgcctca tccggcacac tggagtttaa caatgaaagc 421 ttcaattgga ctggagtcac tcaaaacgga acaagttctg cttgcataag gagatctagt 481 agtagtttct ttagtagatt aaattggttg acccacttaa actacacata cccagcattg 541 aacgtgacta tgccaaacaa tgaacaattt gacaaattgt acatttgggg ggttcaccac 601 ccgggtacgg acaaggacca aatcttcctg tatgctcaat catcaggaag aatcacagta 661 tctaccaaaa gaagccaaca agctgtaatc ccaaatatcg gatctagacc cagaataagg 721 gatatcccta gcagaataag catctattgg acaatagtaa aaccgggaga catacttttg 781 attaacagca cagggaatct aattgctcct aggggttact tcaaaatacg aagtgggaaa 841 agctcaataa tgagatcaga tgcacccatt ggcaaatgca agtctgaatg catcactcca 901 aatggaagca ttcccaatga caaaccattc caaaatgtaa acaggatcac atacggggcc 961 tgtcccagat atgttaagca tagcactcta aaattggcaa caggaatgcg aaatgtacca 1021 gagaaacaaa ctagaggcat atttggcgca atagcgggtt tcatagaaaa tggttgggag 1081 ggaatggtgg atggttggta cggtttcagg catcaaaatt ctgagggaag aggacaagca 1141 gcagatctca aaagcactca agcagcaatc gatcaaatca atgggaagct gaatcgattg 1201 atcgggaaaa ccaacgagaa attccatcag attgaaaaag aattctcaga agtagaagga 1261 agaattcagg accttgagaa atatgttgag gacactaaaa tagatctctg gtcatacaac 1321 gcggagcttc ttgttgccct ggagaaccaa catacarttg atctaactga ctcagaaatg 1381 aacaaactgt ttgaaaaaac aaagaagcaa ctgagggaaa atgctgagga tatgggaaat 1441 ggttgtttca aaatatacca caaatgtgac aatgcctgca taggatcaat aagaaatgga 1501 acttatgacc acaatgtgta cagggatgaa gcattaaaca accggttcca gatcaaggga 1561 gttgagctga agtcagggta caaagattgg atcctatgga tttcctytgc catatcatgt 1621 tttttgcttt gtgttgcttt gttggggttc atcatgtggg cctgccaaaa gggcaacatt 1681 aggtgcaaca tttgcatttg a //
We can also, as we have done before, process this file using the SeqIO
module:
# Download sequence record for genbank id KT220438 (HA from influenza A)
# Using text mode
handle = Entrez.efetch(db="nucleotide", id="KT220438", rettype="gb", retmode="text")
record = SeqIO.read(handle, "genbank") # parse with SeqIO
print(record)
handle.close()
ID: KT220438.1 Name: KT220438 Description: Influenza A virus (A/NewJersey/NHRC_93219/2015(H3N2)) segment 4 hemagglutinin (HA) gene, complete cds Number of features: 5 /structured_comment=OrderedDict([('Assembly-Data', OrderedDict([('Sequencing Technology', 'Sanger dideoxy sequencing')]))]) /date=20-JUL-2015 /molecule_type=cRNA /data_file_division=VRL /keywords=[''] /accessions=['KT220438'] /topology=linear /taxonomy=['Viruses', 'ssRNA viruses', 'ssRNA negative-strand viruses', 'Orthomyxoviridae', 'Influenzavirus A'] /source=Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2)) /organism=Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2)) /sequence_version=1 /references=[Reference(title='GEISS Influenza Surveillance Response Program', ...), Reference(title='Direct Submission', ...)] Seq('ATGAAGACTATCATTGCTTTGAGCTACATTCTATGTCTGGTTTTCGCTCAAAAA...TGA', IUPACAmbiguousDNA())
In addition to text mode, we can also download the data in XML mode. XML is a structured data format that allows for easy machine-processing of complex data files. If we just print the raw data, though, it doesn't look appealing:
# Download sequence record for genbank id KT220438 (HA from influenza A)
# Using XML mode
handle = Entrez.efetch(db="nucleotide", id="KT220438", rettype="gb", retmode="xml")
record = handle.read() # read file directly
print(record)
handle.close()
<?xml version="1.0" encoding="UTF-8" ?> <!DOCTYPE GBSet PUBLIC "-//NCBI//NCBI GBSeq/EN" "https://www.ncbi.nlm.nih.gov/dtd/NCBI_GBSeq.dtd"> <GBSet> <GBSeq> <GBSeq_locus>KT220438</GBSeq_locus> <GBSeq_length>1701</GBSeq_length> <GBSeq_strandedness>single</GBSeq_strandedness> <GBSeq_moltype>cRNA</GBSeq_moltype> <GBSeq_topology>linear</GBSeq_topology> <GBSeq_division>VRL</GBSeq_division> <GBSeq_update-date>20-JUL-2015</GBSeq_update-date> <GBSeq_create-date>20-JUL-2015</GBSeq_create-date> <GBSeq_definition>Influenza A virus (A/NewJersey/NHRC_93219/2015(H3N2)) segment 4 hemagglutinin (HA) gene, complete cds</GBSeq_definition> <GBSeq_primary-accession>KT220438</GBSeq_primary-accession> <GBSeq_accession-version>KT220438.1</GBSeq_accession-version> <GBSeq_other-seqids> <GBSeqid>gb|KT220438.1|</GBSeqid> <GBSeqid>gi|887493048</GBSeqid> </GBSeq_other-seqids> <GBSeq_source>Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))</GBSeq_source> <GBSeq_organism>Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))</GBSeq_organism> <GBSeq_taxonomy>Viruses; ssRNA viruses; ssRNA negative-strand viruses; Orthomyxoviridae; Influenzavirus A</GBSeq_taxonomy> <GBSeq_references> <GBReference> <GBReference_reference>1</GBReference_reference> <GBReference_position>1..1701</GBReference_position> <GBReference_authors> <GBAuthor>Sitz,C.R.</GBAuthor> <GBAuthor>Thammavong,H.L.</GBAuthor> <GBAuthor>Balansay-Ames,M.S.</GBAuthor> <GBAuthor>Hawksworth,A.W.</GBAuthor> <GBAuthor>Myers,C.A.</GBAuthor> <GBAuthor>Brice,G.T.</GBAuthor> </GBReference_authors> <GBReference_title>GEISS Influenza Surveillance Response Program</GBReference_title> <GBReference_journal>Unpublished</GBReference_journal> </GBReference> <GBReference> <GBReference_reference>2</GBReference_reference> <GBReference_position>1..1701</GBReference_position> <GBReference_authors> <GBAuthor>Sitz,C.R.</GBAuthor> <GBAuthor>Thammavong,H.L.</GBAuthor> <GBAuthor>Balansay-Ames,M.S.</GBAuthor> <GBAuthor>Hawksworth,A.W.</GBAuthor> <GBAuthor>Myers,C.A.</GBAuthor> <GBAuthor>Brice,G.T.</GBAuthor> </GBReference_authors> <GBReference_title>Direct Submission</GBReference_title> <GBReference_journal>Submitted (29-JUN-2015) Operational Infectious Diseases, Naval Health Research Center, 140 Sylvester Rd., San Diego, CA 92106, USA</GBReference_journal> </GBReference> </GBSeq_references> <GBSeq_comment>##Assembly-Data-START## ; Sequencing Technology :: Sanger dideoxy sequencing ; ##Assembly-Data-END##</GBSeq_comment> <GBSeq_feature-table> <GBFeature> <GBFeature_key>source</GBFeature_key> <GBFeature_location>1..1701</GBFeature_location> <GBFeature_intervals> <GBInterval> <GBInterval_from>1</GBInterval_from> <GBInterval_to>1701</GBInterval_to> <GBInterval_accession>KT220438.1</GBInterval_accession> </GBInterval> </GBFeature_intervals> <GBFeature_quals> <GBQualifier> <GBQualifier_name>organism</GBQualifier_name> <GBQualifier_value>Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>mol_type</GBQualifier_name> <GBQualifier_value>viral cRNA</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>strain</GBQualifier_name> <GBQualifier_value>A/NewJersey/NHRC_93219/2015</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>serotype</GBQualifier_name> <GBQualifier_value>H3N2</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>isolation_source</GBQualifier_name> <GBQualifier_value>nasopharyngeal swab</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>host</GBQualifier_name> <GBQualifier_value>Homo sapiens</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>db_xref</GBQualifier_name> <GBQualifier_value>taxon:1682360</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>segment</GBQualifier_name> <GBQualifier_value>4</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>lab_host</GBQualifier_name> <GBQualifier_value>MDCK</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>country</GBQualifier_name> <GBQualifier_value>USA: New Jersey</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>collection_date</GBQualifier_name> <GBQualifier_value>17-Jan-2015</GBQualifier_value> </GBQualifier> </GBFeature_quals> </GBFeature> <GBFeature> <GBFeature_key>gene</GBFeature_key> <GBFeature_location>1..1701</GBFeature_location> <GBFeature_intervals> <GBInterval> <GBInterval_from>1</GBInterval_from> <GBInterval_to>1701</GBInterval_to> <GBInterval_accession>KT220438.1</GBInterval_accession> </GBInterval> </GBFeature_intervals> <GBFeature_quals> <GBQualifier> <GBQualifier_name>gene</GBQualifier_name> <GBQualifier_value>HA</GBQualifier_value> </GBQualifier> </GBFeature_quals> </GBFeature> <GBFeature> <GBFeature_key>CDS</GBFeature_key> <GBFeature_location>1..1701</GBFeature_location> <GBFeature_intervals> <GBInterval> <GBInterval_from>1</GBInterval_from> <GBInterval_to>1701</GBInterval_to> <GBInterval_accession>KT220438.1</GBInterval_accession> </GBInterval> </GBFeature_intervals> <GBFeature_quals> <GBQualifier> <GBQualifier_name>gene</GBQualifier_name> <GBQualifier_value>HA</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>function</GBQualifier_name> <GBQualifier_value>receptor binding and fusion protein</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>codon_start</GBQualifier_name> <GBQualifier_value>1</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>transl_table</GBQualifier_name> <GBQualifier_value>1</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>product</GBQualifier_name> <GBQualifier_value>hemagglutinin</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>protein_id</GBQualifier_name> <GBQualifier_value>AKQ43545.1</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>translation</GBQualifier_name> <GBQualifier_value>MKTIIALSYILCLVFAQKIPGNDNSTATLCLGHHAVPNGTIVKTITNDRIEVTNATELVQNSSIGEICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVTQNGTSSACIRRSSSSFFSRLNWLTHLNYTYPALNVTMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRITVSTKRSQQAVIPNIGSRPRIRDIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCKSECITPNGSIPNDKPFQNVNRITYGACPRYVKHSTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTXDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHNVYRDEALNNRFQIKGVELKSGYKDWILWISXAISCFLLCVALLGFIMWACQKGNIRCNICI</GBQualifier_value> </GBQualifier> </GBFeature_quals> </GBFeature> <GBFeature> <GBFeature_key>mat_peptide</GBFeature_key> <GBFeature_location>49..1035</GBFeature_location> <GBFeature_intervals> <GBInterval> <GBInterval_from>49</GBInterval_from> <GBInterval_to>1035</GBInterval_to> <GBInterval_accession>KT220438.1</GBInterval_accession> </GBInterval> </GBFeature_intervals> <GBFeature_quals> <GBQualifier> <GBQualifier_name>gene</GBQualifier_name> <GBQualifier_value>HA</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>product</GBQualifier_name> <GBQualifier_value>HA1</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>peptide</GBQualifier_name> <GBQualifier_value>QKIPGNDNSTATLCLGHHAVPNGTIVKTITNDRIEVTNATELVQNSSIGEICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVTQNGTSSACIRRSSSSFFSRLNWLTHLNYTYPALNVTMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRITVSTKRSQQAVIPNIGSRPRIRDIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCKSECITPNGSIPNDKPFQNVNRITYGACPRYVKHSTLKLATGMRNVPEKQTR</GBQualifier_value> </GBQualifier> </GBFeature_quals> </GBFeature> <GBFeature> <GBFeature_key>mat_peptide</GBFeature_key> <GBFeature_location>1036..1698</GBFeature_location> <GBFeature_intervals> <GBInterval> <GBInterval_from>1036</GBInterval_from> <GBInterval_to>1698</GBInterval_to> <GBInterval_accession>KT220438.1</GBInterval_accession> </GBInterval> </GBFeature_intervals> <GBFeature_quals> <GBQualifier> <GBQualifier_name>gene</GBQualifier_name> <GBQualifier_value>HA</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>product</GBQualifier_name> <GBQualifier_value>HA2</GBQualifier_value> </GBQualifier> <GBQualifier> <GBQualifier_name>peptide</GBQualifier_name> <GBQualifier_value>GIFGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTXDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHNVYRDEALNNRFQIKGVELKSGYKDWILWISXAISCFLLCVALLGFIMWACQKGNIRCNICI</GBQualifier_value> </GBQualifier> </GBFeature_quals> </GBFeature> </GBSeq_feature-table> <GBSeq_sequence>atgaagactatcattgctttgagctacattctatgtctggttttcgctcaaaaaattcctggaaatgacaatagcacggcaacgctgtgccttgggcaccatgcagtaccaaacggaacgatagtgaaaacaatcacaaatgaccgaattgaagttactaatgctactgagctggttcagaattcctcaataggtgaaatatgcgacagtcctcatcagatccttgatggagaaaactgcacactaatagatgctctattgggagaccctcagtgtgatggctttcaaaataagaaatgggacctttttgttgaacgaagcaaagcctacagcaactgctacccttatgatgtgccggattatgcctcccttaggtcactagttgcctcatccggcacactggagtttaacaatgaaagcttcaattggactggagtcactcaaaacggaacaagttctgcttgcataaggagatctagtagtagtttctttagtagattaaattggttgacccacttaaactacacatacccagcattgaacgtgactatgccaaacaatgaacaatttgacaaattgtacatttggggggttcaccacccgggtacggacaaggaccaaatcttcctgtatgctcaatcatcaggaagaatcacagtatctaccaaaagaagccaacaagctgtaatcccaaatatcggatctagacccagaataagggatatccctagcagaataagcatctattggacaatagtaaaaccgggagacatacttttgattaacagcacagggaatctaattgctcctaggggttacttcaaaatacgaagtgggaaaagctcaataatgagatcagatgcacccattggcaaatgcaagtctgaatgcatcactccaaatggaagcattcccaatgacaaaccattccaaaatgtaaacaggatcacatacggggcctgtcccagatatgttaagcatagcactctaaaattggcaacaggaatgcgaaatgtaccagagaaacaaactagaggcatatttggcgcaatagcgggtttcatagaaaatggttgggagggaatggtggatggttggtacggtttcaggcatcaaaattctgagggaagaggacaagcagcagatctcaaaagcactcaagcagcaatcgatcaaatcaatgggaagctgaatcgattgatcgggaaaaccaacgagaaattccatcagattgaaaaagaattctcagaagtagaaggaagaattcaggaccttgagaaatatgttgaggacactaaaatagatctctggtcatacaacgcggagcttcttgttgccctggagaaccaacatacarttgatctaactgactcagaaatgaacaaactgtttgaaaaaacaaagaagcaactgagggaaaatgctgaggatatgggaaatggttgtttcaaaatataccacaaatgtgacaatgcctgcataggatcaataagaaatggaacttatgaccacaatgtgtacagggatgaagcattaaacaaccggttccagatcaagggagttgagctgaagtcagggtacaaagattggatcctatggatttcctytgccatatcatgttttttgctttgtgttgctttgttggggttcatcatgtgggcctgccaaaagggcaacattaggtgcaacatttgcatttga</GBSeq_sequence> </GBSeq> </GBSet>
The advantage of XML mode is that there is the generic Entrez.parse()
function that can parse XML files returned from Entrez.efetch()
. Also, some modules in Biopython cannot work with files obtained in text mode, they can only work on files obtained in XML mode. The documentation will generally tell you for each module what kind of input it expects.
Reading the above example with Entrez.parse()
gives us the following:
# Download sequence record for genbank id KT220438 (HA from influenza A)
handle = Entrez.efetch(db="nucleotide", id="KT220438", rettype="gb", retmode="xml")
parsed = Entrez.parse(handle)
record = list(parsed)[0] # We need to convert the parsed contents into a list. Here we want just the 0th element.
handle.close()
print(record)
DictElement({'GBSeq_division': 'VRL', 'GBSeq_comment': '##Assembly-Data-START## ; Sequencing Technology :: Sanger dideoxy sequencing ; ##Assembly-Data-END##', 'GBSeq_references': [DictElement({'GBReference_reference': '1', 'GBReference_position': '1..1701', 'GBReference_title': 'GEISS Influenza Surveillance Response Program', 'GBReference_authors': ['Sitz,C.R.', 'Thammavong,H.L.', 'Balansay-Ames,M.S.', 'Hawksworth,A.W.', 'Myers,C.A.', 'Brice,G.T.'], 'GBReference_journal': 'Unpublished'}, attributes={}), DictElement({'GBReference_reference': '2', 'GBReference_position': '1..1701', 'GBReference_title': 'Direct Submission', 'GBReference_authors': ['Sitz,C.R.', 'Thammavong,H.L.', 'Balansay-Ames,M.S.', 'Hawksworth,A.W.', 'Myers,C.A.', 'Brice,G.T.'], 'GBReference_journal': 'Submitted (29-JUN-2015) Operational Infectious Diseases, Naval Health Research Center, 140 Sylvester Rd., San Diego, CA 92106, USA'}, attributes={})], 'GBSeq_locus': 'KT220438', 'GBSeq_strandedness': 'single', 'GBSeq_update-date': '20-JUL-2015', 'GBSeq_organism': 'Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))', 'GBSeq_accession-version': 'KT220438.1', 'GBSeq_source': 'Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))', 'GBSeq_topology': 'linear', 'GBSeq_taxonomy': 'Viruses; ssRNA viruses; ssRNA negative-strand viruses; Orthomyxoviridae; Influenzavirus A', 'GBSeq_other-seqids': ['gb|KT220438.1|', 'gi|887493048'], 'GBSeq_definition': 'Influenza A virus (A/NewJersey/NHRC_93219/2015(H3N2)) segment 4 hemagglutinin (HA) gene, complete cds', 'GBSeq_create-date': '20-JUL-2015', 'GBSeq_length': '1701', 'GBSeq_primary-accession': 'KT220438', 'GBSeq_sequence': 'atgaagactatcattgctttgagctacattctatgtctggttttcgctcaaaaaattcctggaaatgacaatagcacggcaacgctgtgccttgggcaccatgcagtaccaaacggaacgatagtgaaaacaatcacaaatgaccgaattgaagttactaatgctactgagctggttcagaattcctcaataggtgaaatatgcgacagtcctcatcagatccttgatggagaaaactgcacactaatagatgctctattgggagaccctcagtgtgatggctttcaaaataagaaatgggacctttttgttgaacgaagcaaagcctacagcaactgctacccttatgatgtgccggattatgcctcccttaggtcactagttgcctcatccggcacactggagtttaacaatgaaagcttcaattggactggagtcactcaaaacggaacaagttctgcttgcataaggagatctagtagtagtttctttagtagattaaattggttgacccacttaaactacacatacccagcattgaacgtgactatgccaaacaatgaacaatttgacaaattgtacatttggggggttcaccacccgggtacggacaaggaccaaatcttcctgtatgctcaatcatcaggaagaatcacagtatctaccaaaagaagccaacaagctgtaatcccaaatatcggatctagacccagaataagggatatccctagcagaataagcatctattggacaatagtaaaaccgggagacatacttttgattaacagcacagggaatctaattgctcctaggggttacttcaaaatacgaagtgggaaaagctcaataatgagatcagatgcacccattggcaaatgcaagtctgaatgcatcactccaaatggaagcattcccaatgacaaaccattccaaaatgtaaacaggatcacatacggggcctgtcccagatatgttaagcatagcactctaaaattggcaacaggaatgcgaaatgtaccagagaaacaaactagaggcatatttggcgcaatagcgggtttcatagaaaatggttgggagggaatggtggatggttggtacggtttcaggcatcaaaattctgagggaagaggacaagcagcagatctcaaaagcactcaagcagcaatcgatcaaatcaatgggaagctgaatcgattgatcgggaaaaccaacgagaaattccatcagattgaaaaagaattctcagaagtagaaggaagaattcaggaccttgagaaatatgttgaggacactaaaatagatctctggtcatacaacgcggagcttcttgttgccctggagaaccaacatacarttgatctaactgactcagaaatgaacaaactgtttgaaaaaacaaagaagcaactgagggaaaatgctgaggatatgggaaatggttgtttcaaaatataccacaaatgtgacaatgcctgcataggatcaataagaaatggaacttatgaccacaatgtgtacagggatgaagcattaaacaaccggttccagatcaagggagttgagctgaagtcagggtacaaagattggatcctatggatttcctytgccatatcatgttttttgctttgtgttgctttgttggggttcatcatgtgggcctgccaaaagggcaacattaggtgcaacatttgcatttga', 'GBSeq_feature-table': [DictElement({'GBFeature_quals': [DictElement({'GBQualifier_name': 'organism', 'GBQualifier_value': 'Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))'}, attributes={}), DictElement({'GBQualifier_name': 'mol_type', 'GBQualifier_value': 'viral cRNA'}, attributes={}), DictElement({'GBQualifier_name': 'strain', 'GBQualifier_value': 'A/NewJersey/NHRC_93219/2015'}, attributes={}), DictElement({'GBQualifier_name': 'serotype', 'GBQualifier_value': 'H3N2'}, attributes={}), DictElement({'GBQualifier_name': 'isolation_source', 'GBQualifier_value': 'nasopharyngeal swab'}, attributes={}), DictElement({'GBQualifier_name': 'host', 'GBQualifier_value': 'Homo sapiens'}, attributes={}), DictElement({'GBQualifier_name': 'db_xref', 'GBQualifier_value': 'taxon:1682360'}, attributes={}), DictElement({'GBQualifier_name': 'segment', 'GBQualifier_value': '4'}, attributes={}), DictElement({'GBQualifier_name': 'lab_host', 'GBQualifier_value': 'MDCK'}, attributes={}), DictElement({'GBQualifier_name': 'country', 'GBQualifier_value': 'USA: New Jersey'}, attributes={}), DictElement({'GBQualifier_name': 'collection_date', 'GBQualifier_value': '17-Jan-2015'}, attributes={})], 'GBFeature_intervals': [DictElement({'GBInterval_to': '1701', 'GBInterval_from': '1', 'GBInterval_accession': 'KT220438.1'}, attributes={})], 'GBFeature_key': 'source', 'GBFeature_location': '1..1701'}, attributes={}), DictElement({'GBFeature_quals': [DictElement({'GBQualifier_name': 'gene', 'GBQualifier_value': 'HA'}, attributes={})], 'GBFeature_intervals': [DictElement({'GBInterval_to': '1701', 'GBInterval_from': '1', 'GBInterval_accession': 'KT220438.1'}, attributes={})], 'GBFeature_key': 'gene', 'GBFeature_location': '1..1701'}, attributes={}), DictElement({'GBFeature_quals': [DictElement({'GBQualifier_name': 'gene', 'GBQualifier_value': 'HA'}, attributes={}), DictElement({'GBQualifier_name': 'function', 'GBQualifier_value': 'receptor binding and fusion protein'}, attributes={}), DictElement({'GBQualifier_name': 'codon_start', 'GBQualifier_value': '1'}, attributes={}), DictElement({'GBQualifier_name': 'transl_table', 'GBQualifier_value': '1'}, attributes={}), DictElement({'GBQualifier_name': 'product', 'GBQualifier_value': 'hemagglutinin'}, attributes={}), DictElement({'GBQualifier_name': 'protein_id', 'GBQualifier_value': 'AKQ43545.1'}, attributes={}), DictElement({'GBQualifier_name': 'translation', 'GBQualifier_value': 'MKTIIALSYILCLVFAQKIPGNDNSTATLCLGHHAVPNGTIVKTITNDRIEVTNATELVQNSSIGEICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVTQNGTSSACIRRSSSSFFSRLNWLTHLNYTYPALNVTMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRITVSTKRSQQAVIPNIGSRPRIRDIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCKSECITPNGSIPNDKPFQNVNRITYGACPRYVKHSTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTXDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHNVYRDEALNNRFQIKGVELKSGYKDWILWISXAISCFLLCVALLGFIMWACQKGNIRCNICI'}, attributes={})], 'GBFeature_intervals': [DictElement({'GBInterval_to': '1701', 'GBInterval_from': '1', 'GBInterval_accession': 'KT220438.1'}, attributes={})], 'GBFeature_key': 'CDS', 'GBFeature_location': '1..1701'}, attributes={}), DictElement({'GBFeature_quals': [DictElement({'GBQualifier_name': 'gene', 'GBQualifier_value': 'HA'}, attributes={}), DictElement({'GBQualifier_name': 'product', 'GBQualifier_value': 'HA1'}, attributes={}), DictElement({'GBQualifier_name': 'peptide', 'GBQualifier_value': 'QKIPGNDNSTATLCLGHHAVPNGTIVKTITNDRIEVTNATELVQNSSIGEICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVTQNGTSSACIRRSSSSFFSRLNWLTHLNYTYPALNVTMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRITVSTKRSQQAVIPNIGSRPRIRDIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCKSECITPNGSIPNDKPFQNVNRITYGACPRYVKHSTLKLATGMRNVPEKQTR'}, attributes={})], 'GBFeature_intervals': [DictElement({'GBInterval_to': '1035', 'GBInterval_from': '49', 'GBInterval_accession': 'KT220438.1'}, attributes={})], 'GBFeature_key': 'mat_peptide', 'GBFeature_location': '49..1035'}, attributes={}), DictElement({'GBFeature_quals': [DictElement({'GBQualifier_name': 'gene', 'GBQualifier_value': 'HA'}, attributes={}), DictElement({'GBQualifier_name': 'product', 'GBQualifier_value': 'HA2'}, attributes={}), DictElement({'GBQualifier_name': 'peptide', 'GBQualifier_value': 'GIFGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTXDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHNVYRDEALNNRFQIKGVELKSGYKDWILWISXAISCFLLCVALLGFIMWACQKGNIRCNICI'}, attributes={})], 'GBFeature_intervals': [DictElement({'GBInterval_to': '1698', 'GBInterval_from': '1036', 'GBInterval_accession': 'KT220438.1'}, attributes={})], 'GBFeature_key': 'mat_peptide', 'GBFeature_location': '1036..1698'}, attributes={})], 'GBSeq_moltype': 'cRNA'}, attributes={})
While this output may not seem useful, we now just have a set of nested dictionaries that we can interrogate using standard Python techniques:
print(list(record.keys())) # print out all the keys in the dictionary
['GBSeq_division', 'GBSeq_comment', 'GBSeq_references', 'GBSeq_locus', 'GBSeq_strandedness', 'GBSeq_update-date', 'GBSeq_organism', 'GBSeq_accession-version', 'GBSeq_source', 'GBSeq_topology', 'GBSeq_taxonomy', 'GBSeq_other-seqids', 'GBSeq_definition', 'GBSeq_create-date', 'GBSeq_length', 'GBSeq_primary-accession', 'GBSeq_sequence', 'GBSeq_feature-table', 'GBSeq_moltype']
print(record['GBSeq_sequence']) # print out the sequence
atgaagactatcattgctttgagctacattctatgtctggttttcgctcaaaaaattcctggaaatgacaatagcacggcaacgctgtgccttgggcaccatgcagtaccaaacggaacgatagtgaaaacaatcacaaatgaccgaattgaagttactaatgctactgagctggttcagaattcctcaataggtgaaatatgcgacagtcctcatcagatccttgatggagaaaactgcacactaatagatgctctattgggagaccctcagtgtgatggctttcaaaataagaaatgggacctttttgttgaacgaagcaaagcctacagcaactgctacccttatgatgtgccggattatgcctcccttaggtcactagttgcctcatccggcacactggagtttaacaatgaaagcttcaattggactggagtcactcaaaacggaacaagttctgcttgcataaggagatctagtagtagtttctttagtagattaaattggttgacccacttaaactacacatacccagcattgaacgtgactatgccaaacaatgaacaatttgacaaattgtacatttggggggttcaccacccgggtacggacaaggaccaaatcttcctgtatgctcaatcatcaggaagaatcacagtatctaccaaaagaagccaacaagctgtaatcccaaatatcggatctagacccagaataagggatatccctagcagaataagcatctattggacaatagtaaaaccgggagacatacttttgattaacagcacagggaatctaattgctcctaggggttacttcaaaatacgaagtgggaaaagctcaataatgagatcagatgcacccattggcaaatgcaagtctgaatgcatcactccaaatggaagcattcccaatgacaaaccattccaaaatgtaaacaggatcacatacggggcctgtcccagatatgttaagcatagcactctaaaattggcaacaggaatgcgaaatgtaccagagaaacaaactagaggcatatttggcgcaatagcgggtttcatagaaaatggttgggagggaatggtggatggttggtacggtttcaggcatcaaaattctgagggaagaggacaagcagcagatctcaaaagcactcaagcagcaatcgatcaaatcaatgggaagctgaatcgattgatcgggaaaaccaacgagaaattccatcagattgaaaaagaattctcagaagtagaaggaagaattcaggaccttgagaaatatgttgaggacactaaaatagatctctggtcatacaacgcggagcttcttgttgccctggagaaccaacatacarttgatctaactgactcagaaatgaacaaactgtttgaaaaaacaaagaagcaactgagggaaaatgctgaggatatgggaaatggttgtttcaaaatataccacaaatgtgacaatgcctgcataggatcaataagaaatggaacttatgaccacaatgtgtacagggatgaagcattaaacaaccggttccagatcaagggagttgagctgaagtcagggtacaaagattggatcctatggatttcctytgccatatcatgttttttgctttgtgttgctttgttggggttcatcatgtgggcctgccaaaagggcaacattaggtgcaacatttgcatttga
features = record['GBSeq_feature-table'] # extract all the features
for feature in features: # loop over features and print feature key and feature location
print(feature['GBFeature_key'] + ": " + feature['GBFeature_location'])
source: 1..1701 gene: 1..1701 CDS: 1..1701 mat_peptide: 49..1035 mat_peptide: 1036..1698
So far we have only downloaded specific records from Entrez. In addition to just downloading records, however, we can also run searches directly from python. Any query that you can do on the Entrez website (https://www.ncbi.nlm.nih.gov/) can also be executed directly from python. This allows you to find a large number of records all at once and process them in an automated fashion.
For example, below we will see how to automatically run and retrieve the results for the following search term: "influenza a virus texas h1n1 hemagglutinin complete cds". A direct link to the search results on the Entrez website is here:
https://www.ncbi.nlm.nih.gov/nuccore/?term=influenza+a+virus+texas+h1n1+hemagglutinin+complete+cds
(Note that in the following Python code, we limit the number of search hits returned to the first 10.)
# let's do a search for influenza H1N1 viruses from Texas
handle = Entrez.esearch(db="nucleotide", # database to search
term="influenza a virus texas h1n1 hemagglutinin complete cds", # search term
retmax=10 # number of results that are returned
)
record = Entrez.read(handle)
handle.close()
gi_list = record["IdList"] # list of genbank identifiers found
print(gi_list)
['1597259311', '1597259287', '1597258925', '1597258900', '1591440116', '1591440105', '1591440078', '1590855198', '1590855179', '1590836043']
Note that even though NCBI is phasing out sequence GI numbers, for now the esearch()
function still returns GI numbers (numerical sequence identifiers without version information).
We can download all the genbank records in the list of identifiers using the Entrez.efetch()
function. This function provides us with a handle that needs to be processed with SeqIO.parse()
. (We used SeqIO.read()
previously, which reads a single record. SeqIO.parse()
reads multiple records. See here for details.)
handle = Entrez.efetch(db="nucleotide", id=gi_list, rettype="gb", retmode="text")
records = SeqIO.parse(handle, "genbank")
for record in records:
print(record.description)
handle.close() # important, close the handle only after you have iterated over the records. Otherwise you will get an error!
Influenza A virus (A/Texas/12/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/07/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/06/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/05/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/157/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/143/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/02/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/148/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/147/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/156/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
As another example, let's search the "pubmed" database (database of scientific publications) for papers from 2015 written by "Wilke CO". The exact search term we need to use is the following: "Wilke CO[Author] AND 2015[Date - Publication]"
You can click here to see the result from that search online.
handle = Entrez.esearch(db="pubmed", # database to search
term="Wilke CO[Author] AND 2015[Date - Publication]", # search term
retmax=10 # number of results that are returned
)
record = Entrez.read(handle)
handle.close()
# search returns PubMed IDs (pmids)
pmid_list = record["IdList"]
print(pmid_list)
['26770819', '26468068', '26430238', '26397960', '26355089', '26275208', '26020774', '25999509', '25787027', '25737813']
Just like with genes and genomes, we can download the records corresponding to these identifiers. They are references. We'll print the author(s), title, and reference (source).
from Bio import Medline
handle = Entrez.efetch(db="pubmed", id=pmid_list, rettype="medline", retmode="text")
records = Medline.parse(handle)
for record in records:
print(record['AU']) # author list
print(record['TI']) # title
print(record['SO']) # source (reference)
print()
handle.close()
['Meyer AG', 'Spielman SJ', 'Bedford T', 'Wilke CO'] Time dependence of evolutionary metrics during the 2009 pandemic influenza virus outbreak. Virus Evol. 2015 Jan;1(1). doi: 10.1093/ve/vev006. Epub 2015 Jan 1. ['Meyer AG', 'Wilke CO'] The utility of protein structure as a predictor of site-wise dN/dS varies widely among HIV-1 proteins. J R Soc Interface. 2015 Oct 6;12(111):20150579. doi: 10.1098/rsif.2015.0579. ['Wilke CO'] Evolutionary paths of least resistance. Proc Natl Acad Sci U S A. 2015 Oct 13;112(41):12553-4. doi: 10.1073/pnas.1517390112. Epub 2015 Oct 1. ['Spielman SJ', 'Wilke CO'] Pyvolve: A Flexible Python Module for Simulating Sequences along Phylogenies. PLoS One. 2015 Sep 23;10(9):e0139047. doi: 10.1371/journal.pone.0139047. eCollection 2015. ['Kerr SA', 'Jackson EL', 'Lungu OI', 'Meyer AG', 'Demogines A', 'Ellington AD', 'Georgiou G', 'Wilke CO', 'Sawyer SL'] Computational and Functional Analysis of the Virus-Receptor Interface Reveals Host Range Trade-Offs in New World Arenaviruses. J Virol. 2015 Nov;89(22):11643-53. doi: 10.1128/JVI.01408-15. Epub 2015 Sep 9. ['Houser JR', 'Barnhart C', 'Boutz DR', 'Carroll SM', 'Dasgupta A', 'Michener JK', 'Needham BD', 'Papoulas O', 'Sridhara V', 'Sydykova DK', 'Marx CJ', 'Trent MS', 'Barrick JE', 'Marcotte EM', 'Wilke CO'] Controlled Measurement and Comparative Analysis of Cellular Components in E. coli Reveals Broad Regulatory Changes in Response to Glucose Starvation. PLoS Comput Biol. 2015 Aug 14;11(8):e1004400. doi: 10.1371/journal.pcbi.1004400. eCollection 2015 Aug. ['Meyer AG', 'Wilke CO'] Geometric Constraints Dominate the Antigenic Evolution of Influenza H3N2 Hemagglutinin. PLoS Pathog. 2015 May 28;11(5):e1004940. doi: 10.1371/journal.ppat.1004940. eCollection 2015 May. ['Kachroo AH', 'Laurent JM', 'Yellman CM', 'Meyer AG', 'Wilke CO', 'Marcotte EM'] Evolution. Systematic humanization of yeast genes reveals conserved functions and genetic modularity. Science. 2015 May 22;348(6237):921-5. doi: 10.1126/science.aaa0769. ['Echave J', 'Jackson EL', 'Wilke CO'] Relationship between protein thermodynamic constraints and variation of evolutionary rates among sites. Phys Biol. 2015 Mar 19;12(2):025002. doi: 10.1088/1478-3975/12/2/025002. ['Spielman SJ', 'Kumar K', 'Wilke CO'] Comprehensive, structurally-informed alignment and phylogeny of vertebrate biogenic amine receptors. PeerJ. 2015 Feb 17;3:e773. doi: 10.7717/peerj.773. eCollection 2015.
Problem 1
Use the following code to download the genbank record KT220438
in XML and parse it with the Entrez.parse()
function:
# Download sequence record for genbank id KT220438 (HA from influenza A)
handle = Entrez.efetch(db="nucleotide", id="KT220438", rettype="gb", retmode="xml")
parsed = Entrez.parse(handle)
record = list(parsed)[0] # Convert the parsed contents into a list and take element 0.
handle.close()
Then:
(a) Print out the value for the key GBSeq_definition
.
(b) Find the CDS
feature and print out all its qualifiers. Note that qualifiers are provided under the keyword GBFeature_quals
.
# Problem 1a
print(record['GBSeq_definition'])
Influenza A virus (A/NewJersey/NHRC_93219/2015(H3N2)) segment 4 hemagglutinin (HA) gene, complete cds
# Problem 1b
features = record['GBSeq_feature-table'] # extract all the features
for feature in features: # loop over features and find CDS feature
if feature['GBFeature_key']=='CDS':
CDS_feature = feature
break
qualifiers = CDS_feature['GBFeature_quals']
for q in qualifiers:
print(q['GBQualifier_name'] + ": " + q['GBQualifier_value'])
gene: HA function: receptor binding and fusion protein codon_start: 1 transl_table: 1 product: hemagglutinin protein_id: AKQ43545.1 translation: MKTIIALSYILCLVFAQKIPGNDNSTATLCLGHHAVPNGTIVKTITNDRIEVTNATELVQNSSIGEICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVTQNGTSSACIRRSSSSFFSRLNWLTHLNYTYPALNVTMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRITVSTKRSQQAVIPNIGSRPRIRDIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCKSECITPNGSIPNDKPFQNVNRITYGACPRYVKHSTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTXDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHNVYRDEALNNRFQIKGVELKSGYKDWILWISXAISCFLLCVALLGFIMWACQKGNIRCNICI
Problem 2:
(a) Use an Entrez esearch query of the pubmed database to find out how many publications "Spielman SJ" wrote in 2015.
(b) From the results of part (a), compile a combined list of all the co-authors of "Spielman SJ" in 2015.
# Problem 2a
handle = Entrez.esearch(db="pubmed", # database to search
term="Spielman SJ[Author] AND 2015[Date - Publication]", # search term
retmax=10 # number of results that are returned
)
record = Entrez.read(handle)
handle.close()
# search returns PubMed IDs (pmids)
pmid_list = record["IdList"]
print("Publications found:", pmid_list)
print("Number of publications:", len(pmid_list))
Publications found: ['26770819', '26397960', '25737813', '25576365'] Number of publications: 4
# Problem 2b
from Bio import Medline
handle = Entrez.efetch(db="pubmed", id=pmid_list, rettype="medline", retmode="text")
records = Medline.parse(handle)
coauthors = [] # start with empty list of coauthors
for record in records:
au_list = record['AU']
for author in au_list:
if author != "Spielman SJ" and author not in coauthors:
coauthors.append(author)
handle.close()
print('Co-authors of "Spielman SJ" in 2015:')
for author in coauthors:
print(" ", author)
Co-authors of "Spielman SJ" in 2015: Meyer AG Bedford T Wilke CO Kumar K
Problem 3:
For larger searches, NCBI wants you to use the WebEnv method to download all your search results. This is explained in the Biopython tutorial here. Rewrite the influenza search from the section "Running search queries on through Entrez" in such a way that it uses the WebEnv method. For this downloading method, it makes sense to write all the results into a file and then read the results back in.
handle = Entrez.esearch(db="nucleotide", # database to search
term="influenza a virus texas h1n1 hemagglutinin complete cds", # search term
usehistory="y" # this switches on the WebEnv method
)
results = Entrez.read(handle)
handle.close()
# Because we ran the search with usehistory="y", the results now contain two additional
# pieces of information, the WebEnv session cookie and the QueryKey:
webenv = results["WebEnv"]
query_key = results["QueryKey"]
# We also get the number of search results:
count = int(results["Count"])
# We now download the results and store them in a local file
# Downloading happens in batches, let's download 20 results at a time:
batch_size = 20
out_handle = open("influenza_HA.gb", "w")
for start in range(0, count, batch_size):
end = min(count, start+batch_size)
print("Downloading records %i through %i" % (start+1, end))
fetch_handle = Entrez.efetch(db="nucleotide", rettype="gb", retmode="text",
retstart=start, retmax=batch_size,
webenv=webenv, query_key=query_key)
data = fetch_handle.read()
fetch_handle.close()
out_handle.write(data)
out_handle.close()
Downloading records 1 through 20 Downloading records 21 through 40 Downloading records 41 through 60 Downloading records 61 through 80 Downloading records 81 through 100 Downloading records 101 through 120 Downloading records 121 through 140 Downloading records 141 through 160 Downloading records 161 through 180 Downloading records 181 through 200 Downloading records 201 through 220 Downloading records 221 through 240 Downloading records 241 through 260 Downloading records 261 through 280 Downloading records 281 through 300 Downloading records 301 through 320 Downloading records 321 through 340 Downloading records 341 through 360 Downloading records 361 through 380 Downloading records 381 through 400 Downloading records 401 through 420 Downloading records 421 through 440 Downloading records 441 through 460 Downloading records 461 through 480 Downloading records 481 through 500 Downloading records 501 through 520 Downloading records 521 through 540 Downloading records 541 through 560 Downloading records 561 through 580 Downloading records 581 through 600 Downloading records 601 through 620 Downloading records 621 through 640 Downloading records 641 through 660 Downloading records 661 through 680 Downloading records 681 through 700 Downloading records 701 through 720 Downloading records 721 through 740 Downloading records 741 through 744
# We can now open the file we created and parse all the records we have previously stored
in_handle = open("influenza_HA.gb", "r")
records = SeqIO.parse(in_handle, "genbank")
for record in records:
print(record.description)
in_handle.close()
Influenza A virus (A/Texas/12/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/07/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/06/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/05/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/157/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/143/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/02/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/148/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/147/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/156/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/154/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/A01785919/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7939/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7923/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7921/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7920/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7918/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7917/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7916/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7914/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7913/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7911/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7910/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7908/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7906/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7904/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7902/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7901/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7898/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7895/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7949/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7947/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7938/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7937/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7936/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7931/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7930/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7929/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7928/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7926/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7925/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7924/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7912/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7907/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7905/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7903/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7900/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7899/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7897/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7896/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7894/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7893/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/152/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/151/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/140/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/A01785906/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7430/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7429/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7389/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/144/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/145/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7670/2018()) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7668/2018()) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7666/2018()) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7664/2018()) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7662/2018()) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7658/2018()) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7656/2018()) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7192/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/142/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/143/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7050/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7045/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7003/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/6992/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/6988/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/6984/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/6980/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/6979/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/6974/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/140/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/134/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/138/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/135/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/141/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/134/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/136/2018(H3N2)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/7504/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/139/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/138/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/A01785737/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/6843/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/A01785599/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/A01785528/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/06/2018(H1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/403/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/402/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/401/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/400/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/93/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/blue-winged teal/Texas/UGAI15-6706/2015(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/blue-winged teal/Texas/UGAI15-5632/2015(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/6417/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/88/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/85/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/84/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/81/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/75/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/74/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/72/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/70/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/01/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/69/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/316/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/314/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/311/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/310/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/08/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/03/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/65/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/323/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/322/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/320/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/318/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/A01785406/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/A01785377/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/316/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/314/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/310/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/311/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/300/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/295/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/294/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/295/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/294/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/A02221802/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/6180/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/6178/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/6176/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/6175/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/6170/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/281/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/28/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/160/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/13/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/157/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/150/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/146/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/144/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/143/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/140/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/115/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/114/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/113/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/112/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/111/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/110/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/109/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/108/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/107/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/106/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/105/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/104/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/103/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/102/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/101/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/100/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/100/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/99/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/98/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/94/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/93/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/79/2015(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/135/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/127/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/122/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/47/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/137/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/132/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/128/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/125/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/117/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/116/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/61/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/55/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/55/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/53/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/51/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/51/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/50/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/38/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/87/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/84/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/81/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/37/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/75/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/73/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/72/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/70/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/69/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/68/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/18/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/16/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/13/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/04/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/03/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/02/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/46/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/40/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/39/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/32/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/31/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/28/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/27/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/27/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/25/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/24/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/20/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/10/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/78/2015(H1N1)) clone 3025624445_284379_v1_4 segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/02/2016(H1N1)) clone 3000415970_284773_v1_4 segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/03/2016(H1N1)) clone 3000415977_284780_v1_4 segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/21/2016(H1N1)) clone 3025624773_ZZYPO19U_v1_4 segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/05/2016(H1N1)) clone 3000416250_285448_v1_4 segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/07/2016(H1N1)) clone 3000416252_285445_v1_4 segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/09/2016(H1N1)) clone 3025624576_285571_v1_4 segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/27/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/09/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/11/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/10/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/16/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/02/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/96/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/82/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/42/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/117/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/08/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/12/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/07/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/36/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/26/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/89/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/86/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/88/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/85/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/87/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/07/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/15/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/13/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/106/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/99/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/10/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/90/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/07/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/24/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/49/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/48/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/24/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/26/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/23/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/27/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/31/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/35/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/33/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/34/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/23/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/23/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/29/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/18/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/05/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/04/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/47/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/36/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/30/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/47/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/22/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/21/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/37/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/02/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/37/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/36/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/79/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/08/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/07/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/14/2010(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/01/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/01/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/04/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/09/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/10/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/48/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/57/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/76/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/77/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/64/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/58/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/62/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/45/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/73/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/59/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/60/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/72/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/56/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/61/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/65/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/63/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/66/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/66/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/01/2010(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/67/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/67/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/75/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/71/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/70/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/69/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/03/2010(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/42/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/01/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/38/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/01/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/36/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/32/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/35/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/23/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/04/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/05/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/29/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/27/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/23/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/16/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/15/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/18/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/17/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/11/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/09/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/10/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/08/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/74/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/78/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/70/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/41/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/32/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/31/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/06/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/05/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/194/2016(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/89/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/63/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/41/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/101/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/83/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/103/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/86/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/81/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/79/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/A02214607/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/69/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/63/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/58/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/57/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/56/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/44/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/42/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/41/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/51/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/49/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/43/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/31/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/37/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/39/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/31/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/34/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/22/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/21/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/08/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/18/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/06/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/08/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/04/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/200/2016(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/199/2016(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/03/2017(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/200/2016(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/199/2016(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/194/2016(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/5485/2016(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/5484/2016(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/5483/2016(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/5028/2016(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/blue-winged teal/Texas/AI12-4549/2012(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/13TOSU0035/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/13TOSU0050/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/A01432097/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/UR06-0422/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/reassortant/IVR-148(Brisbane/59/2007 x Texas/1/1977)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/13TOSU0041/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/A01410177/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3689/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3688/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3687/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3686/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3685/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3684/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3683/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3682/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3681/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3680/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3679/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3678/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3677/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3676/2014(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3675/2014(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3674/2014(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3673/2014(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3672/2014(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3671/2014(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3670/2014(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3669/2014(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3668/2014(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3667/2014(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3666/2014(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3665/2014(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3664/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3663/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3662/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3661/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3660/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3571/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3570/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3520/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3515/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3513/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3512/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3510/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3509/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3508/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3475/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3474/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/3425/2013(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/00525/2005(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/SG1187/2004(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/SG1187/2004(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/SG1162/2003(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/00195/2003(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/SG1162/2003(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/00867/2005(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/00244/2004(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/SG1380/2011(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/01657/2007(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/SG1271/2007(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/01522/2007(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/mallard/Alberta/211/1998(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/ruddy turnstone/Delaware Bay/123/1994(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMM_53/2012(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMM_52/2012(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/36-JY2/1991(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/northern pintail/Texas/421716/2001(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/reassortant/IVR-148(Brisbane/59/2007 x Texas/1/1977)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/reassortant/IVR-148(Brisbane/59/2007 x Texas/1/1977)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/reassortant/IVR-148(Brisbane/59/2007 x Texas/1/1977)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/A01104003/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/swine/Texas/A01104004/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS368/2009(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS360/2009(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/45034240/2009(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS403/2010(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS401/2010(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS400/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS367/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS366/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS415/2010(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS414/2010(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS413/2010(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS412/2010(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS411/2010(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS410/2010(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS409/2010(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS408/2010(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS407/2010(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS406/2010(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS405/2010(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS404/2010(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS402/2010(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/JMS399/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS398/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS397/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS395/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS394/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS393/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS392/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS391/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS390/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS389/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS388/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS387/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS386/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS385/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS384/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS383/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS382/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS381/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS380/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS379/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS373/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS372/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS371/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS370/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS369/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS365/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS364/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS363/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS362/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS361/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS359/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS358/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS356/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46241654/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46240925/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46233104/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46224042/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46223582/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46223444/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46222134/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46221665/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46214103/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46201823/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46193632/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS378/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS377/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS376/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/JMS375/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46193311/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46192760/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/461917783/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46182018/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46181292/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46181235/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46172734/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/46172731/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/44312415/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42211898/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42202026/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42132413/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42254309/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42103399/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42091791/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/43292238/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42151049/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42102708/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42221280/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/43143450/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42201798/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42191647/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/44302551/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/44301765/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42163291/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42303371/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42121926/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42281289/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Brownsville/37OS/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45061755/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45122886/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45032753/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45121004/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45052569/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45113911/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45032708/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45102952/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45062346/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45024243/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/44151841/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45042604/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45023717/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/44152535/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45132202/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45131305/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45072128/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45072656/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45071344/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45131576/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45072273/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45043852/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45021632/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/44282651/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45101422/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45103759/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45101424/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45091417/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45091405/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45091402/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45140902/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45093214/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45083819/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45071524/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45010998/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/44313703/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45093670/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45093846/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45103259/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45062584/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45061670/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/44302167/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/44302533/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45034157/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45033567/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45033774/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45132647/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45131774/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45132788/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45122722/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45122774/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45132214/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45113882/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45103998/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45104048/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45103737/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45104026/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45113371/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45122036/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45122282/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45122033/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45121606/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45122369/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45130742/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45122538/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45120922/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/43122467/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42192947/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/43132503/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42291877/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42142537/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42163295/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/43200999/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/43272683/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/45062633/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42114261/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42113095/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42173957/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42201824/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42122969/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42191653/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42123701/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/43011033/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/43242018/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/42152486/2009(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/47/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/29/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/30/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/05/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/04/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/20/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/39/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/reassortant/IDCDC-RG22(New York/18/2009 x Puerto Rico/8/1934)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/reassortant/IDCDC-RG20(Texas/05/2009 x Puerto Rico/8/1934)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/reassortant/IDCDC-RG18(Texas/05/2009 x New York/18/2009 x Puerto Rico/8/1934)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Italy/85/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/37/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/36/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/34/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/31/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/32/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/11/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/17/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/28/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/10/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/04/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/reassortant/IDCDC-RG15(Texas/05/2009 x Puerto Rico/8/1934)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/12/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/05/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/09/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/08/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/19/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/22/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/23/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/15/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/07/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/08/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/09/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/06/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/04/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/04/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/WRAIR1118P/2009(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/WRAIR1115P/2009(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/WRAIR1053P/2009(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/X-113(X-31B-Texas/36/1991)(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/36/91(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Wilson-Smith/1933(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/USSR/90/1977(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Texas/36/1991(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/Taiwan/01/1986(H1N1)) hemagglutinin (HA) gene, complete cds Influenza A virus (A/New Caledonia/20/1999(H1N1)) hemagglutinin (HA) gene, complete cds Synthetic construct derived from Influenza A virus (A/Texas/15/2009(H1N1)) hemagglutinin gene, complete cds Influenza A virus (A/Texas/36/91(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0323/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0156/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/36/1991(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0250/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0026/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0217/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0038/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0420/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0503/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0039/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0133/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0216/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0445/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0204/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0357/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0342/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0176/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0305/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0157/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0582/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0308/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0309/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0397/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0468/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0270/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0444/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0175/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0271/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0542/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0540/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0306/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0195/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0303/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0174/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0359/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0193/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0526/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0461/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0563/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0196/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0467/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0203/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0398/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0025/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0380/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0502/2007(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/UR06-0012/2006(H1N1)) segment 4, complete sequence Influenza A virus (A/Baylor/4052/1981(H1N1)) segment 4, complete sequence Influenza A virus (A/Texas/2922-3/1986(H1N1)) segment 4, complete sequence Influenza A virus (A/Lackland/7/1978(H1N1)) segment 4, complete sequence Influenza A virus (A/Lackland/3/1978(H1N1)) segment 4, complete sequence