Class 21: Searching the NCBI databases

April 9, 2019

Different ways of processing Entrez results

Every time we download data through the Entrez module, we can interact with the results in different ways. First, we can just use the handle we obtain as an ordinary file handle and just store or process the raw data provided by Entrez. Second, we can process the data with an appropriate, existing Biopython module. The latter will generally be preferable if an appropriate module exists. However, the various choices that are available may make things confusing.

As an example, we will again download the genbank record with the ID "KT220438", containing an influenza HA protein. We will consider four different ways of looking at the data. First, we use retmode="text" in Entrez.efetch() and just download the raw data and print it. We get a regular genbank file as output:

In [1]:
from Bio import Entrez, SeqIO
Entrez.email = "wilke@austin.utexas.edu" # put your email here

# Download sequence record for genbank id KT220438 (HA from influenza A)
# Using text mode
handle = Entrez.efetch(db="nucleotide", id="KT220438", rettype="gb", retmode="text")
record = handle.read() # read file directly
print(record)
handle.close()
LOCUS       KT220438                1701 bp    cRNA    linear   VRL 20-JUL-2015
DEFINITION  Influenza A virus (A/NewJersey/NHRC_93219/2015(H3N2)) segment 4
            hemagglutinin (HA) gene, complete cds.
ACCESSION   KT220438
VERSION     KT220438.1
KEYWORDS    .
SOURCE      Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))
  ORGANISM  Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))
            Viruses; ssRNA viruses; ssRNA negative-strand viruses;
            Orthomyxoviridae; Influenzavirus A.
REFERENCE   1  (bases 1 to 1701)
  AUTHORS   Sitz,C.R., Thammavong,H.L., Balansay-Ames,M.S., Hawksworth,A.W.,
            Myers,C.A. and Brice,G.T.
  TITLE     GEISS Influenza Surveillance Response Program
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 1701)
  AUTHORS   Sitz,C.R., Thammavong,H.L., Balansay-Ames,M.S., Hawksworth,A.W.,
            Myers,C.A. and Brice,G.T.
  TITLE     Direct Submission
  JOURNAL   Submitted (29-JUN-2015) Operational Infectious Diseases, Naval
            Health Research Center, 140 Sylvester Rd., San Diego, CA 92106, USA
COMMENT     ##Assembly-Data-START##
            Sequencing Technology :: Sanger dideoxy sequencing
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..1701
                     /organism="Influenza A virus (A/New
                     Jersey/NHRC_93219/2015(H3N2))"
                     /mol_type="viral cRNA"
                     /strain="A/NewJersey/NHRC_93219/2015"
                     /serotype="H3N2"
                     /isolation_source="nasopharyngeal swab"
                     /host="Homo sapiens"
                     /db_xref="taxon:1682360"
                     /segment="4"
                     /lab_host="MDCK"
                     /country="USA: New Jersey"
                     /collection_date="17-Jan-2015"
     gene            1..1701
                     /gene="HA"
     CDS             1..1701
                     /gene="HA"
                     /function="receptor binding and fusion protein"
                     /codon_start=1
                     /product="hemagglutinin"
                     /protein_id="AKQ43545.1"
                     /translation="MKTIIALSYILCLVFAQKIPGNDNSTATLCLGHHAVPNGTIVKT
                     ITNDRIEVTNATELVQNSSIGEICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDL
                     FVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVTQNGTSSACIRRSS
                     SSFFSRLNWLTHLNYTYPALNVTMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRI
                     TVSTKRSQQAVIPNIGSRPRIRDIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKI
                     RSGKSSIMRSDAPIGKCKSECITPNGSIPNDKPFQNVNRITYGACPRYVKHSTLKLAT
                     GMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQ
                     INGKLNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQ
                     HTXDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHNVYR
                     DEALNNRFQIKGVELKSGYKDWILWISXAISCFLLCVALLGFIMWACQKGNIRCNICI
                     "
     mat_peptide     49..1035
                     /gene="HA"
                     /product="HA1"
     mat_peptide     1036..1698
                     /gene="HA"
                     /product="HA2"
ORIGIN      
        1 atgaagacta tcattgcttt gagctacatt ctatgtctgg ttttcgctca aaaaattcct
       61 ggaaatgaca atagcacggc aacgctgtgc cttgggcacc atgcagtacc aaacggaacg
      121 atagtgaaaa caatcacaaa tgaccgaatt gaagttacta atgctactga gctggttcag
      181 aattcctcaa taggtgaaat atgcgacagt cctcatcaga tccttgatgg agaaaactgc
      241 acactaatag atgctctatt gggagaccct cagtgtgatg gctttcaaaa taagaaatgg
      301 gacctttttg ttgaacgaag caaagcctac agcaactgct acccttatga tgtgccggat
      361 tatgcctccc ttaggtcact agttgcctca tccggcacac tggagtttaa caatgaaagc
      421 ttcaattgga ctggagtcac tcaaaacgga acaagttctg cttgcataag gagatctagt
      481 agtagtttct ttagtagatt aaattggttg acccacttaa actacacata cccagcattg
      541 aacgtgacta tgccaaacaa tgaacaattt gacaaattgt acatttgggg ggttcaccac
      601 ccgggtacgg acaaggacca aatcttcctg tatgctcaat catcaggaag aatcacagta
      661 tctaccaaaa gaagccaaca agctgtaatc ccaaatatcg gatctagacc cagaataagg
      721 gatatcccta gcagaataag catctattgg acaatagtaa aaccgggaga catacttttg
      781 attaacagca cagggaatct aattgctcct aggggttact tcaaaatacg aagtgggaaa
      841 agctcaataa tgagatcaga tgcacccatt ggcaaatgca agtctgaatg catcactcca
      901 aatggaagca ttcccaatga caaaccattc caaaatgtaa acaggatcac atacggggcc
      961 tgtcccagat atgttaagca tagcactcta aaattggcaa caggaatgcg aaatgtacca
     1021 gagaaacaaa ctagaggcat atttggcgca atagcgggtt tcatagaaaa tggttgggag
     1081 ggaatggtgg atggttggta cggtttcagg catcaaaatt ctgagggaag aggacaagca
     1141 gcagatctca aaagcactca agcagcaatc gatcaaatca atgggaagct gaatcgattg
     1201 atcgggaaaa ccaacgagaa attccatcag attgaaaaag aattctcaga agtagaagga
     1261 agaattcagg accttgagaa atatgttgag gacactaaaa tagatctctg gtcatacaac
     1321 gcggagcttc ttgttgccct ggagaaccaa catacarttg atctaactga ctcagaaatg
     1381 aacaaactgt ttgaaaaaac aaagaagcaa ctgagggaaa atgctgagga tatgggaaat
     1441 ggttgtttca aaatatacca caaatgtgac aatgcctgca taggatcaat aagaaatgga
     1501 acttatgacc acaatgtgta cagggatgaa gcattaaaca accggttcca gatcaaggga
     1561 gttgagctga agtcagggta caaagattgg atcctatgga tttcctytgc catatcatgt
     1621 tttttgcttt gtgttgcttt gttggggttc atcatgtggg cctgccaaaa gggcaacatt
     1681 aggtgcaaca tttgcatttg a
//


We can also, as we have done before, process this file using the SeqIO module:

In [2]:
# Download sequence record for genbank id KT220438 (HA from influenza A)
# Using text mode
handle = Entrez.efetch(db="nucleotide", id="KT220438", rettype="gb", retmode="text")
record = SeqIO.read(handle, "genbank") # parse with SeqIO
print(record)
handle.close()
ID: KT220438.1
Name: KT220438
Description: Influenza A virus (A/NewJersey/NHRC_93219/2015(H3N2)) segment 4 hemagglutinin (HA) gene, complete cds
Number of features: 5
/structured_comment=OrderedDict([('Assembly-Data', OrderedDict([('Sequencing Technology', 'Sanger dideoxy sequencing')]))])
/date=20-JUL-2015
/molecule_type=cRNA
/data_file_division=VRL
/keywords=['']
/accessions=['KT220438']
/topology=linear
/taxonomy=['Viruses', 'ssRNA viruses', 'ssRNA negative-strand viruses', 'Orthomyxoviridae', 'Influenzavirus A']
/source=Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))
/organism=Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))
/sequence_version=1
/references=[Reference(title='GEISS Influenza Surveillance Response Program', ...), Reference(title='Direct Submission', ...)]
Seq('ATGAAGACTATCATTGCTTTGAGCTACATTCTATGTCTGGTTTTCGCTCAAAAA...TGA', IUPACAmbiguousDNA())

In addition to text mode, we can also download the data in XML mode. XML is a structured data format that allows for easy machine-processing of complex data files. If we just print the raw data, though, it doesn't look appealing:

In [3]:
# Download sequence record for genbank id KT220438 (HA from influenza A)
# Using XML mode
handle = Entrez.efetch(db="nucleotide", id="KT220438", rettype="gb", retmode="xml")
record = handle.read() # read file directly
print(record)
handle.close()
<?xml version="1.0" encoding="UTF-8"  ?>
<!DOCTYPE GBSet PUBLIC "-//NCBI//NCBI GBSeq/EN" "https://www.ncbi.nlm.nih.gov/dtd/NCBI_GBSeq.dtd">
<GBSet>
  <GBSeq>

    <GBSeq_locus>KT220438</GBSeq_locus>
    <GBSeq_length>1701</GBSeq_length>
    <GBSeq_strandedness>single</GBSeq_strandedness>
    <GBSeq_moltype>cRNA</GBSeq_moltype>
    <GBSeq_topology>linear</GBSeq_topology>
    <GBSeq_division>VRL</GBSeq_division>
    <GBSeq_update-date>20-JUL-2015</GBSeq_update-date>
    <GBSeq_create-date>20-JUL-2015</GBSeq_create-date>
    <GBSeq_definition>Influenza A virus (A/NewJersey/NHRC_93219/2015(H3N2)) segment 4 hemagglutinin (HA) gene, complete cds</GBSeq_definition>
    <GBSeq_primary-accession>KT220438</GBSeq_primary-accession>
    <GBSeq_accession-version>KT220438.1</GBSeq_accession-version>
    <GBSeq_other-seqids>
      <GBSeqid>gb|KT220438.1|</GBSeqid>
      <GBSeqid>gi|887493048</GBSeqid>
    </GBSeq_other-seqids>
    <GBSeq_source>Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))</GBSeq_source>
    <GBSeq_organism>Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))</GBSeq_organism>
    <GBSeq_taxonomy>Viruses; ssRNA viruses; ssRNA negative-strand viruses; Orthomyxoviridae; Influenzavirus A</GBSeq_taxonomy>
    <GBSeq_references>
      <GBReference>
        <GBReference_reference>1</GBReference_reference>
        <GBReference_position>1..1701</GBReference_position>
        <GBReference_authors>
          <GBAuthor>Sitz,C.R.</GBAuthor>
          <GBAuthor>Thammavong,H.L.</GBAuthor>
          <GBAuthor>Balansay-Ames,M.S.</GBAuthor>
          <GBAuthor>Hawksworth,A.W.</GBAuthor>
          <GBAuthor>Myers,C.A.</GBAuthor>
          <GBAuthor>Brice,G.T.</GBAuthor>
        </GBReference_authors>
        <GBReference_title>GEISS Influenza Surveillance Response Program</GBReference_title>
        <GBReference_journal>Unpublished</GBReference_journal>
      </GBReference>
      <GBReference>
        <GBReference_reference>2</GBReference_reference>
        <GBReference_position>1..1701</GBReference_position>
        <GBReference_authors>
          <GBAuthor>Sitz,C.R.</GBAuthor>
          <GBAuthor>Thammavong,H.L.</GBAuthor>
          <GBAuthor>Balansay-Ames,M.S.</GBAuthor>
          <GBAuthor>Hawksworth,A.W.</GBAuthor>
          <GBAuthor>Myers,C.A.</GBAuthor>
          <GBAuthor>Brice,G.T.</GBAuthor>
        </GBReference_authors>
        <GBReference_title>Direct Submission</GBReference_title>
        <GBReference_journal>Submitted (29-JUN-2015) Operational Infectious Diseases, Naval Health Research Center, 140 Sylvester Rd., San Diego, CA 92106, USA</GBReference_journal>
      </GBReference>
    </GBSeq_references>
    <GBSeq_comment>##Assembly-Data-START## ; Sequencing Technology :: Sanger dideoxy sequencing ; ##Assembly-Data-END##</GBSeq_comment>
    <GBSeq_feature-table>
      <GBFeature>
        <GBFeature_key>source</GBFeature_key>
        <GBFeature_location>1..1701</GBFeature_location>
        <GBFeature_intervals>
          <GBInterval>
            <GBInterval_from>1</GBInterval_from>
            <GBInterval_to>1701</GBInterval_to>
            <GBInterval_accession>KT220438.1</GBInterval_accession>
          </GBInterval>
        </GBFeature_intervals>
        <GBFeature_quals>
          <GBQualifier>
            <GBQualifier_name>organism</GBQualifier_name>
            <GBQualifier_value>Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>mol_type</GBQualifier_name>
            <GBQualifier_value>viral cRNA</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>strain</GBQualifier_name>
            <GBQualifier_value>A/NewJersey/NHRC_93219/2015</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>serotype</GBQualifier_name>
            <GBQualifier_value>H3N2</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>isolation_source</GBQualifier_name>
            <GBQualifier_value>nasopharyngeal swab</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>host</GBQualifier_name>
            <GBQualifier_value>Homo sapiens</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>db_xref</GBQualifier_name>
            <GBQualifier_value>taxon:1682360</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>segment</GBQualifier_name>
            <GBQualifier_value>4</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>lab_host</GBQualifier_name>
            <GBQualifier_value>MDCK</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>country</GBQualifier_name>
            <GBQualifier_value>USA: New Jersey</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>collection_date</GBQualifier_name>
            <GBQualifier_value>17-Jan-2015</GBQualifier_value>
          </GBQualifier>
        </GBFeature_quals>
      </GBFeature>
      <GBFeature>
        <GBFeature_key>gene</GBFeature_key>
        <GBFeature_location>1..1701</GBFeature_location>
        <GBFeature_intervals>
          <GBInterval>
            <GBInterval_from>1</GBInterval_from>
            <GBInterval_to>1701</GBInterval_to>
            <GBInterval_accession>KT220438.1</GBInterval_accession>
          </GBInterval>
        </GBFeature_intervals>
        <GBFeature_quals>
          <GBQualifier>
            <GBQualifier_name>gene</GBQualifier_name>
            <GBQualifier_value>HA</GBQualifier_value>
          </GBQualifier>
        </GBFeature_quals>
      </GBFeature>
      <GBFeature>
        <GBFeature_key>CDS</GBFeature_key>
        <GBFeature_location>1..1701</GBFeature_location>
        <GBFeature_intervals>
          <GBInterval>
            <GBInterval_from>1</GBInterval_from>
            <GBInterval_to>1701</GBInterval_to>
            <GBInterval_accession>KT220438.1</GBInterval_accession>
          </GBInterval>
        </GBFeature_intervals>
        <GBFeature_quals>
          <GBQualifier>
            <GBQualifier_name>gene</GBQualifier_name>
            <GBQualifier_value>HA</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>function</GBQualifier_name>
            <GBQualifier_value>receptor binding and fusion protein</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>codon_start</GBQualifier_name>
            <GBQualifier_value>1</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>transl_table</GBQualifier_name>
            <GBQualifier_value>1</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>product</GBQualifier_name>
            <GBQualifier_value>hemagglutinin</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>protein_id</GBQualifier_name>
            <GBQualifier_value>AKQ43545.1</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>translation</GBQualifier_name>
            <GBQualifier_value>MKTIIALSYILCLVFAQKIPGNDNSTATLCLGHHAVPNGTIVKTITNDRIEVTNATELVQNSSIGEICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVTQNGTSSACIRRSSSSFFSRLNWLTHLNYTYPALNVTMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRITVSTKRSQQAVIPNIGSRPRIRDIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCKSECITPNGSIPNDKPFQNVNRITYGACPRYVKHSTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTXDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHNVYRDEALNNRFQIKGVELKSGYKDWILWISXAISCFLLCVALLGFIMWACQKGNIRCNICI</GBQualifier_value>
          </GBQualifier>
        </GBFeature_quals>
      </GBFeature>
      <GBFeature>
        <GBFeature_key>mat_peptide</GBFeature_key>
        <GBFeature_location>49..1035</GBFeature_location>
        <GBFeature_intervals>
          <GBInterval>
            <GBInterval_from>49</GBInterval_from>
            <GBInterval_to>1035</GBInterval_to>
            <GBInterval_accession>KT220438.1</GBInterval_accession>
          </GBInterval>
        </GBFeature_intervals>
        <GBFeature_quals>
          <GBQualifier>
            <GBQualifier_name>gene</GBQualifier_name>
            <GBQualifier_value>HA</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>product</GBQualifier_name>
            <GBQualifier_value>HA1</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>peptide</GBQualifier_name>
            <GBQualifier_value>QKIPGNDNSTATLCLGHHAVPNGTIVKTITNDRIEVTNATELVQNSSIGEICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVTQNGTSSACIRRSSSSFFSRLNWLTHLNYTYPALNVTMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRITVSTKRSQQAVIPNIGSRPRIRDIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCKSECITPNGSIPNDKPFQNVNRITYGACPRYVKHSTLKLATGMRNVPEKQTR</GBQualifier_value>
          </GBQualifier>
        </GBFeature_quals>
      </GBFeature>
      <GBFeature>
        <GBFeature_key>mat_peptide</GBFeature_key>
        <GBFeature_location>1036..1698</GBFeature_location>
        <GBFeature_intervals>
          <GBInterval>
            <GBInterval_from>1036</GBInterval_from>
            <GBInterval_to>1698</GBInterval_to>
            <GBInterval_accession>KT220438.1</GBInterval_accession>
          </GBInterval>
        </GBFeature_intervals>
        <GBFeature_quals>
          <GBQualifier>
            <GBQualifier_name>gene</GBQualifier_name>
            <GBQualifier_value>HA</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>product</GBQualifier_name>
            <GBQualifier_value>HA2</GBQualifier_value>
          </GBQualifier>
          <GBQualifier>
            <GBQualifier_name>peptide</GBQualifier_name>
            <GBQualifier_value>GIFGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTXDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHNVYRDEALNNRFQIKGVELKSGYKDWILWISXAISCFLLCVALLGFIMWACQKGNIRCNICI</GBQualifier_value>
          </GBQualifier>
        </GBFeature_quals>
      </GBFeature>
    </GBSeq_feature-table>
    <GBSeq_sequence>atgaagactatcattgctttgagctacattctatgtctggttttcgctcaaaaaattcctggaaatgacaatagcacggcaacgctgtgccttgggcaccatgcagtaccaaacggaacgatagtgaaaacaatcacaaatgaccgaattgaagttactaatgctactgagctggttcagaattcctcaataggtgaaatatgcgacagtcctcatcagatccttgatggagaaaactgcacactaatagatgctctattgggagaccctcagtgtgatggctttcaaaataagaaatgggacctttttgttgaacgaagcaaagcctacagcaactgctacccttatgatgtgccggattatgcctcccttaggtcactagttgcctcatccggcacactggagtttaacaatgaaagcttcaattggactggagtcactcaaaacggaacaagttctgcttgcataaggagatctagtagtagtttctttagtagattaaattggttgacccacttaaactacacatacccagcattgaacgtgactatgccaaacaatgaacaatttgacaaattgtacatttggggggttcaccacccgggtacggacaaggaccaaatcttcctgtatgctcaatcatcaggaagaatcacagtatctaccaaaagaagccaacaagctgtaatcccaaatatcggatctagacccagaataagggatatccctagcagaataagcatctattggacaatagtaaaaccgggagacatacttttgattaacagcacagggaatctaattgctcctaggggttacttcaaaatacgaagtgggaaaagctcaataatgagatcagatgcacccattggcaaatgcaagtctgaatgcatcactccaaatggaagcattcccaatgacaaaccattccaaaatgtaaacaggatcacatacggggcctgtcccagatatgttaagcatagcactctaaaattggcaacaggaatgcgaaatgtaccagagaaacaaactagaggcatatttggcgcaatagcgggtttcatagaaaatggttgggagggaatggtggatggttggtacggtttcaggcatcaaaattctgagggaagaggacaagcagcagatctcaaaagcactcaagcagcaatcgatcaaatcaatgggaagctgaatcgattgatcgggaaaaccaacgagaaattccatcagattgaaaaagaattctcagaagtagaaggaagaattcaggaccttgagaaatatgttgaggacactaaaatagatctctggtcatacaacgcggagcttcttgttgccctggagaaccaacatacarttgatctaactgactcagaaatgaacaaactgtttgaaaaaacaaagaagcaactgagggaaaatgctgaggatatgggaaatggttgtttcaaaatataccacaaatgtgacaatgcctgcataggatcaataagaaatggaacttatgaccacaatgtgtacagggatgaagcattaaacaaccggttccagatcaagggagttgagctgaagtcagggtacaaagattggatcctatggatttcctytgccatatcatgttttttgctttgtgttgctttgttggggttcatcatgtgggcctgccaaaagggcaacattaggtgcaacatttgcatttga</GBSeq_sequence>
  </GBSeq>

</GBSet>

The advantage of XML mode is that there is the generic Entrez.parse() function that can parse XML files returned from Entrez.efetch(). Also, some modules in Biopython cannot work with files obtained in text mode, they can only work on files obtained in XML mode. The documentation will generally tell you for each module what kind of input it expects.

Reading the above example with Entrez.parse() gives us the following:

In [4]:
# Download sequence record for genbank id KT220438 (HA from influenza A)
handle = Entrez.efetch(db="nucleotide", id="KT220438", rettype="gb", retmode="xml")
parsed = Entrez.parse(handle)
record = list(parsed)[0] # We need to convert the parsed contents into a list. Here we want just the 0th element.
handle.close()
print(record)
DictElement({'GBSeq_division': 'VRL', 'GBSeq_comment': '##Assembly-Data-START## ; Sequencing Technology :: Sanger dideoxy sequencing ; ##Assembly-Data-END##', 'GBSeq_references': [DictElement({'GBReference_reference': '1', 'GBReference_position': '1..1701', 'GBReference_title': 'GEISS Influenza Surveillance Response Program', 'GBReference_authors': ['Sitz,C.R.', 'Thammavong,H.L.', 'Balansay-Ames,M.S.', 'Hawksworth,A.W.', 'Myers,C.A.', 'Brice,G.T.'], 'GBReference_journal': 'Unpublished'}, attributes={}), DictElement({'GBReference_reference': '2', 'GBReference_position': '1..1701', 'GBReference_title': 'Direct Submission', 'GBReference_authors': ['Sitz,C.R.', 'Thammavong,H.L.', 'Balansay-Ames,M.S.', 'Hawksworth,A.W.', 'Myers,C.A.', 'Brice,G.T.'], 'GBReference_journal': 'Submitted (29-JUN-2015) Operational Infectious Diseases, Naval Health Research Center, 140 Sylvester Rd., San Diego, CA 92106, USA'}, attributes={})], 'GBSeq_locus': 'KT220438', 'GBSeq_strandedness': 'single', 'GBSeq_update-date': '20-JUL-2015', 'GBSeq_organism': 'Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))', 'GBSeq_accession-version': 'KT220438.1', 'GBSeq_source': 'Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))', 'GBSeq_topology': 'linear', 'GBSeq_taxonomy': 'Viruses; ssRNA viruses; ssRNA negative-strand viruses; Orthomyxoviridae; Influenzavirus A', 'GBSeq_other-seqids': ['gb|KT220438.1|', 'gi|887493048'], 'GBSeq_definition': 'Influenza A virus (A/NewJersey/NHRC_93219/2015(H3N2)) segment 4 hemagglutinin (HA) gene, complete cds', 'GBSeq_create-date': '20-JUL-2015', 'GBSeq_length': '1701', 'GBSeq_primary-accession': 'KT220438', 'GBSeq_sequence': 'atgaagactatcattgctttgagctacattctatgtctggttttcgctcaaaaaattcctggaaatgacaatagcacggcaacgctgtgccttgggcaccatgcagtaccaaacggaacgatagtgaaaacaatcacaaatgaccgaattgaagttactaatgctactgagctggttcagaattcctcaataggtgaaatatgcgacagtcctcatcagatccttgatggagaaaactgcacactaatagatgctctattgggagaccctcagtgtgatggctttcaaaataagaaatgggacctttttgttgaacgaagcaaagcctacagcaactgctacccttatgatgtgccggattatgcctcccttaggtcactagttgcctcatccggcacactggagtttaacaatgaaagcttcaattggactggagtcactcaaaacggaacaagttctgcttgcataaggagatctagtagtagtttctttagtagattaaattggttgacccacttaaactacacatacccagcattgaacgtgactatgccaaacaatgaacaatttgacaaattgtacatttggggggttcaccacccgggtacggacaaggaccaaatcttcctgtatgctcaatcatcaggaagaatcacagtatctaccaaaagaagccaacaagctgtaatcccaaatatcggatctagacccagaataagggatatccctagcagaataagcatctattggacaatagtaaaaccgggagacatacttttgattaacagcacagggaatctaattgctcctaggggttacttcaaaatacgaagtgggaaaagctcaataatgagatcagatgcacccattggcaaatgcaagtctgaatgcatcactccaaatggaagcattcccaatgacaaaccattccaaaatgtaaacaggatcacatacggggcctgtcccagatatgttaagcatagcactctaaaattggcaacaggaatgcgaaatgtaccagagaaacaaactagaggcatatttggcgcaatagcgggtttcatagaaaatggttgggagggaatggtggatggttggtacggtttcaggcatcaaaattctgagggaagaggacaagcagcagatctcaaaagcactcaagcagcaatcgatcaaatcaatgggaagctgaatcgattgatcgggaaaaccaacgagaaattccatcagattgaaaaagaattctcagaagtagaaggaagaattcaggaccttgagaaatatgttgaggacactaaaatagatctctggtcatacaacgcggagcttcttgttgccctggagaaccaacatacarttgatctaactgactcagaaatgaacaaactgtttgaaaaaacaaagaagcaactgagggaaaatgctgaggatatgggaaatggttgtttcaaaatataccacaaatgtgacaatgcctgcataggatcaataagaaatggaacttatgaccacaatgtgtacagggatgaagcattaaacaaccggttccagatcaagggagttgagctgaagtcagggtacaaagattggatcctatggatttcctytgccatatcatgttttttgctttgtgttgctttgttggggttcatcatgtgggcctgccaaaagggcaacattaggtgcaacatttgcatttga', 'GBSeq_feature-table': [DictElement({'GBFeature_quals': [DictElement({'GBQualifier_name': 'organism', 'GBQualifier_value': 'Influenza A virus (A/New Jersey/NHRC_93219/2015(H3N2))'}, attributes={}), DictElement({'GBQualifier_name': 'mol_type', 'GBQualifier_value': 'viral cRNA'}, attributes={}), DictElement({'GBQualifier_name': 'strain', 'GBQualifier_value': 'A/NewJersey/NHRC_93219/2015'}, attributes={}), DictElement({'GBQualifier_name': 'serotype', 'GBQualifier_value': 'H3N2'}, attributes={}), DictElement({'GBQualifier_name': 'isolation_source', 'GBQualifier_value': 'nasopharyngeal swab'}, attributes={}), DictElement({'GBQualifier_name': 'host', 'GBQualifier_value': 'Homo sapiens'}, attributes={}), DictElement({'GBQualifier_name': 'db_xref', 'GBQualifier_value': 'taxon:1682360'}, attributes={}), DictElement({'GBQualifier_name': 'segment', 'GBQualifier_value': '4'}, attributes={}), DictElement({'GBQualifier_name': 'lab_host', 'GBQualifier_value': 'MDCK'}, attributes={}), DictElement({'GBQualifier_name': 'country', 'GBQualifier_value': 'USA: New Jersey'}, attributes={}), DictElement({'GBQualifier_name': 'collection_date', 'GBQualifier_value': '17-Jan-2015'}, attributes={})], 'GBFeature_intervals': [DictElement({'GBInterval_to': '1701', 'GBInterval_from': '1', 'GBInterval_accession': 'KT220438.1'}, attributes={})], 'GBFeature_key': 'source', 'GBFeature_location': '1..1701'}, attributes={}), DictElement({'GBFeature_quals': [DictElement({'GBQualifier_name': 'gene', 'GBQualifier_value': 'HA'}, attributes={})], 'GBFeature_intervals': [DictElement({'GBInterval_to': '1701', 'GBInterval_from': '1', 'GBInterval_accession': 'KT220438.1'}, attributes={})], 'GBFeature_key': 'gene', 'GBFeature_location': '1..1701'}, attributes={}), DictElement({'GBFeature_quals': [DictElement({'GBQualifier_name': 'gene', 'GBQualifier_value': 'HA'}, attributes={}), DictElement({'GBQualifier_name': 'function', 'GBQualifier_value': 'receptor binding and fusion protein'}, attributes={}), DictElement({'GBQualifier_name': 'codon_start', 'GBQualifier_value': '1'}, attributes={}), DictElement({'GBQualifier_name': 'transl_table', 'GBQualifier_value': '1'}, attributes={}), DictElement({'GBQualifier_name': 'product', 'GBQualifier_value': 'hemagglutinin'}, attributes={}), DictElement({'GBQualifier_name': 'protein_id', 'GBQualifier_value': 'AKQ43545.1'}, attributes={}), DictElement({'GBQualifier_name': 'translation', 'GBQualifier_value': 'MKTIIALSYILCLVFAQKIPGNDNSTATLCLGHHAVPNGTIVKTITNDRIEVTNATELVQNSSIGEICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVTQNGTSSACIRRSSSSFFSRLNWLTHLNYTYPALNVTMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRITVSTKRSQQAVIPNIGSRPRIRDIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCKSECITPNGSIPNDKPFQNVNRITYGACPRYVKHSTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTXDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHNVYRDEALNNRFQIKGVELKSGYKDWILWISXAISCFLLCVALLGFIMWACQKGNIRCNICI'}, attributes={})], 'GBFeature_intervals': [DictElement({'GBInterval_to': '1701', 'GBInterval_from': '1', 'GBInterval_accession': 'KT220438.1'}, attributes={})], 'GBFeature_key': 'CDS', 'GBFeature_location': '1..1701'}, attributes={}), DictElement({'GBFeature_quals': [DictElement({'GBQualifier_name': 'gene', 'GBQualifier_value': 'HA'}, attributes={}), DictElement({'GBQualifier_name': 'product', 'GBQualifier_value': 'HA1'}, attributes={}), DictElement({'GBQualifier_name': 'peptide', 'GBQualifier_value': 'QKIPGNDNSTATLCLGHHAVPNGTIVKTITNDRIEVTNATELVQNSSIGEICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVTQNGTSSACIRRSSSSFFSRLNWLTHLNYTYPALNVTMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRITVSTKRSQQAVIPNIGSRPRIRDIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCKSECITPNGSIPNDKPFQNVNRITYGACPRYVKHSTLKLATGMRNVPEKQTR'}, attributes={})], 'GBFeature_intervals': [DictElement({'GBInterval_to': '1035', 'GBInterval_from': '49', 'GBInterval_accession': 'KT220438.1'}, attributes={})], 'GBFeature_key': 'mat_peptide', 'GBFeature_location': '49..1035'}, attributes={}), DictElement({'GBFeature_quals': [DictElement({'GBQualifier_name': 'gene', 'GBQualifier_value': 'HA'}, attributes={}), DictElement({'GBQualifier_name': 'product', 'GBQualifier_value': 'HA2'}, attributes={}), DictElement({'GBQualifier_name': 'peptide', 'GBQualifier_value': 'GIFGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTXDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHNVYRDEALNNRFQIKGVELKSGYKDWILWISXAISCFLLCVALLGFIMWACQKGNIRCNICI'}, attributes={})], 'GBFeature_intervals': [DictElement({'GBInterval_to': '1698', 'GBInterval_from': '1036', 'GBInterval_accession': 'KT220438.1'}, attributes={})], 'GBFeature_key': 'mat_peptide', 'GBFeature_location': '1036..1698'}, attributes={})], 'GBSeq_moltype': 'cRNA'}, attributes={})

While this output may not seem useful, we now just have a set of nested dictionaries that we can interrogate using standard Python techniques:

In [5]:
print(list(record.keys())) # print out all the keys in the dictionary
['GBSeq_division', 'GBSeq_comment', 'GBSeq_references', 'GBSeq_locus', 'GBSeq_strandedness', 'GBSeq_update-date', 'GBSeq_organism', 'GBSeq_accession-version', 'GBSeq_source', 'GBSeq_topology', 'GBSeq_taxonomy', 'GBSeq_other-seqids', 'GBSeq_definition', 'GBSeq_create-date', 'GBSeq_length', 'GBSeq_primary-accession', 'GBSeq_sequence', 'GBSeq_feature-table', 'GBSeq_moltype']
In [6]:
print(record['GBSeq_sequence']) # print out the sequence
atgaagactatcattgctttgagctacattctatgtctggttttcgctcaaaaaattcctggaaatgacaatagcacggcaacgctgtgccttgggcaccatgcagtaccaaacggaacgatagtgaaaacaatcacaaatgaccgaattgaagttactaatgctactgagctggttcagaattcctcaataggtgaaatatgcgacagtcctcatcagatccttgatggagaaaactgcacactaatagatgctctattgggagaccctcagtgtgatggctttcaaaataagaaatgggacctttttgttgaacgaagcaaagcctacagcaactgctacccttatgatgtgccggattatgcctcccttaggtcactagttgcctcatccggcacactggagtttaacaatgaaagcttcaattggactggagtcactcaaaacggaacaagttctgcttgcataaggagatctagtagtagtttctttagtagattaaattggttgacccacttaaactacacatacccagcattgaacgtgactatgccaaacaatgaacaatttgacaaattgtacatttggggggttcaccacccgggtacggacaaggaccaaatcttcctgtatgctcaatcatcaggaagaatcacagtatctaccaaaagaagccaacaagctgtaatcccaaatatcggatctagacccagaataagggatatccctagcagaataagcatctattggacaatagtaaaaccgggagacatacttttgattaacagcacagggaatctaattgctcctaggggttacttcaaaatacgaagtgggaaaagctcaataatgagatcagatgcacccattggcaaatgcaagtctgaatgcatcactccaaatggaagcattcccaatgacaaaccattccaaaatgtaaacaggatcacatacggggcctgtcccagatatgttaagcatagcactctaaaattggcaacaggaatgcgaaatgtaccagagaaacaaactagaggcatatttggcgcaatagcgggtttcatagaaaatggttgggagggaatggtggatggttggtacggtttcaggcatcaaaattctgagggaagaggacaagcagcagatctcaaaagcactcaagcagcaatcgatcaaatcaatgggaagctgaatcgattgatcgggaaaaccaacgagaaattccatcagattgaaaaagaattctcagaagtagaaggaagaattcaggaccttgagaaatatgttgaggacactaaaatagatctctggtcatacaacgcggagcttcttgttgccctggagaaccaacatacarttgatctaactgactcagaaatgaacaaactgtttgaaaaaacaaagaagcaactgagggaaaatgctgaggatatgggaaatggttgtttcaaaatataccacaaatgtgacaatgcctgcataggatcaataagaaatggaacttatgaccacaatgtgtacagggatgaagcattaaacaaccggttccagatcaagggagttgagctgaagtcagggtacaaagattggatcctatggatttcctytgccatatcatgttttttgctttgtgttgctttgttggggttcatcatgtgggcctgccaaaagggcaacattaggtgcaacatttgcatttga
In [7]:
features = record['GBSeq_feature-table'] # extract all the features
for feature in features: # loop over features and print feature key and feature location
    print(feature['GBFeature_key'] + ": " + feature['GBFeature_location'])
source: 1..1701
gene: 1..1701
CDS: 1..1701
mat_peptide: 49..1035
mat_peptide: 1036..1698

Running search queries through Entrez

So far we have only downloaded specific records from Entrez. In addition to just downloading records, however, we can also run searches directly from python. Any query that you can do on the Entrez website (https://www.ncbi.nlm.nih.gov/) can also be executed directly from python. This allows you to find a large number of records all at once and process them in an automated fashion.

For example, below we will see how to automatically run and retrieve the results for the following search term: "influenza a virus texas h1n1 hemagglutinin complete cds". A direct link to the search results on the Entrez website is here:
https://www.ncbi.nlm.nih.gov/nuccore/?term=influenza+a+virus+texas+h1n1+hemagglutinin+complete+cds

(Note that in the following Python code, we limit the number of search hits returned to the first 10.)

In [8]:
# let's do a search for influenza H1N1 viruses from Texas
handle = Entrez.esearch(db="nucleotide",  # database to search
                        term="influenza a virus texas h1n1 hemagglutinin complete cds",  # search term
                        retmax=10  # number of results that are returned
                        )
record = Entrez.read(handle)
handle.close()

gi_list = record["IdList"] # list of genbank identifiers found
print(gi_list)
['1597259311', '1597259287', '1597258925', '1597258900', '1591440116', '1591440105', '1591440078', '1590855198', '1590855179', '1590836043']

Note that even though NCBI is phasing out sequence GI numbers, for now the esearch() function still returns GI numbers (numerical sequence identifiers without version information).

We can download all the genbank records in the list of identifiers using the Entrez.efetch() function. This function provides us with a handle that needs to be processed with SeqIO.parse(). (We used SeqIO.read() previously, which reads a single record. SeqIO.parse() reads multiple records. See here for details.)

In [9]:
handle = Entrez.efetch(db="nucleotide", id=gi_list, rettype="gb", retmode="text")
records = SeqIO.parse(handle, "genbank")

for record in records:
    print(record.description)
    
handle.close() # important, close the handle only after you have iterated over the records. Otherwise you will get an error!
Influenza A virus (A/Texas/12/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/07/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/06/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/05/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/157/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/143/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/02/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/148/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/147/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/156/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds

As another example, let's search the "pubmed" database (database of scientific publications) for papers from 2015 written by "Wilke CO". The exact search term we need to use is the following: "Wilke CO[Author] AND 2015[Date - Publication]"

You can click here to see the result from that search online.

In [10]:
handle = Entrez.esearch(db="pubmed",  # database to search
                        term="Wilke CO[Author] AND 2015[Date - Publication]",  # search term
                        retmax=10  # number of results that are returned
                        )
record = Entrez.read(handle)
handle.close()

# search returns PubMed IDs (pmids)
pmid_list = record["IdList"]
print(pmid_list)
['26770819', '26468068', '26430238', '26397960', '26355089', '26275208', '26020774', '25999509', '25787027', '25737813']

Just like with genes and genomes, we can download the records corresponding to these identifiers. They are references. We'll print the author(s), title, and reference (source).

In [11]:
from Bio import Medline
handle = Entrez.efetch(db="pubmed", id=pmid_list, rettype="medline", retmode="text")
records = Medline.parse(handle)
for record in records:
    print(record['AU']) # author list
    print(record['TI']) # title
    print(record['SO']) # source (reference)
    print()
handle.close()
['Meyer AG', 'Spielman SJ', 'Bedford T', 'Wilke CO']
Time dependence of evolutionary metrics during the 2009 pandemic influenza virus outbreak.
Virus Evol. 2015 Jan;1(1). doi: 10.1093/ve/vev006. Epub 2015 Jan 1.

['Meyer AG', 'Wilke CO']
The utility of protein structure as a predictor of site-wise dN/dS varies widely among HIV-1 proteins.
J R Soc Interface. 2015 Oct 6;12(111):20150579. doi: 10.1098/rsif.2015.0579.

['Wilke CO']
Evolutionary paths of least resistance.
Proc Natl Acad Sci U S A. 2015 Oct 13;112(41):12553-4. doi: 10.1073/pnas.1517390112. Epub 2015 Oct 1.

['Spielman SJ', 'Wilke CO']
Pyvolve: A Flexible Python Module for Simulating Sequences along Phylogenies.
PLoS One. 2015 Sep 23;10(9):e0139047. doi: 10.1371/journal.pone.0139047. eCollection 2015.

['Kerr SA', 'Jackson EL', 'Lungu OI', 'Meyer AG', 'Demogines A', 'Ellington AD', 'Georgiou G', 'Wilke CO', 'Sawyer SL']
Computational and Functional Analysis of the Virus-Receptor Interface Reveals Host Range Trade-Offs in New World Arenaviruses.
J Virol. 2015 Nov;89(22):11643-53. doi: 10.1128/JVI.01408-15. Epub 2015 Sep 9.

['Houser JR', 'Barnhart C', 'Boutz DR', 'Carroll SM', 'Dasgupta A', 'Michener JK', 'Needham BD', 'Papoulas O', 'Sridhara V', 'Sydykova DK', 'Marx CJ', 'Trent MS', 'Barrick JE', 'Marcotte EM', 'Wilke CO']
Controlled Measurement and Comparative Analysis of Cellular Components in E. coli Reveals Broad Regulatory Changes in Response to Glucose Starvation.
PLoS Comput Biol. 2015 Aug 14;11(8):e1004400. doi: 10.1371/journal.pcbi.1004400. eCollection 2015 Aug.

['Meyer AG', 'Wilke CO']
Geometric Constraints Dominate the Antigenic Evolution of Influenza H3N2 Hemagglutinin.
PLoS Pathog. 2015 May 28;11(5):e1004940. doi: 10.1371/journal.ppat.1004940. eCollection 2015 May.

['Kachroo AH', 'Laurent JM', 'Yellman CM', 'Meyer AG', 'Wilke CO', 'Marcotte EM']
Evolution. Systematic humanization of yeast genes reveals conserved functions and genetic modularity.
Science. 2015 May 22;348(6237):921-5. doi: 10.1126/science.aaa0769.

['Echave J', 'Jackson EL', 'Wilke CO']
Relationship between protein thermodynamic constraints and variation of evolutionary rates among sites.
Phys Biol. 2015 Mar 19;12(2):025002. doi: 10.1088/1478-3975/12/2/025002.

['Spielman SJ', 'Kumar K', 'Wilke CO']
Comprehensive, structurally-informed alignment and phylogeny of vertebrate biogenic amine receptors.
PeerJ. 2015 Feb 17;3:e773. doi: 10.7717/peerj.773. eCollection 2015.

Problems

Problem 1

Use the following code to download the genbank record KT220438 in XML and parse it with the Entrez.parse() function:

In [12]:
# Download sequence record for genbank id KT220438 (HA from influenza A)
handle = Entrez.efetch(db="nucleotide", id="KT220438", rettype="gb", retmode="xml")
parsed = Entrez.parse(handle)
record = list(parsed)[0] # Convert the parsed contents into a list and take element 0.
handle.close()

Then:

(a) Print out the value for the key GBSeq_definition.

(b) Find the CDS feature and print out all its qualifiers. Note that qualifiers are provided under the keyword GBFeature_quals.

In [13]:
# Problem 1a

print(record['GBSeq_definition'])
Influenza A virus (A/NewJersey/NHRC_93219/2015(H3N2)) segment 4 hemagglutinin (HA) gene, complete cds
In [14]:
# Problem 1b

features = record['GBSeq_feature-table'] # extract all the features
for feature in features: # loop over features and find CDS feature
    if feature['GBFeature_key']=='CDS':
        CDS_feature = feature
        break

qualifiers = CDS_feature['GBFeature_quals']
for q in qualifiers:
    print(q['GBQualifier_name'] + ": " + q['GBQualifier_value'])
gene: HA
function: receptor binding and fusion protein
codon_start: 1
transl_table: 1
product: hemagglutinin
protein_id: AKQ43545.1
translation: MKTIIALSYILCLVFAQKIPGNDNSTATLCLGHHAVPNGTIVKTITNDRIEVTNATELVQNSSIGEICDSPHQILDGENCTLIDALLGDPQCDGFQNKKWDLFVERSKAYSNCYPYDVPDYASLRSLVASSGTLEFNNESFNWTGVTQNGTSSACIRRSSSSFFSRLNWLTHLNYTYPALNVTMPNNEQFDKLYIWGVHHPGTDKDQIFLYAQSSGRITVSTKRSQQAVIPNIGSRPRIRDIPSRISIYWTIVKPGDILLINSTGNLIAPRGYFKIRSGKSSIMRSDAPIGKCKSECITPNGSIPNDKPFQNVNRITYGACPRYVKHSTLKLATGMRNVPEKQTRGIFGAIAGFIENGWEGMVDGWYGFRHQNSEGRGQAADLKSTQAAIDQINGKLNRLIGKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTXDLTDSEMNKLFEKTKKQLRENAEDMGNGCFKIYHKCDNACIGSIRNGTYDHNVYRDEALNNRFQIKGVELKSGYKDWILWISXAISCFLLCVALLGFIMWACQKGNIRCNICI

Problem 2:

(a) Use an Entrez esearch query of the pubmed database to find out how many publications "Spielman SJ" wrote in 2015.

(b) From the results of part (a), compile a combined list of all the co-authors of "Spielman SJ" in 2015.

In [15]:
# Problem 2a

handle = Entrez.esearch(db="pubmed",  # database to search
                        term="Spielman SJ[Author] AND 2015[Date - Publication]",  # search term
                        retmax=10  # number of results that are returned
                        )
record = Entrez.read(handle)
handle.close()

# search returns PubMed IDs (pmids)
pmid_list = record["IdList"]
print("Publications found:", pmid_list)
print("Number of publications:", len(pmid_list))
Publications found: ['26770819', '26397960', '25737813', '25576365']
Number of publications: 4
In [16]:
# Problem 2b

from Bio import Medline
handle = Entrez.efetch(db="pubmed", id=pmid_list, rettype="medline", retmode="text")
records = Medline.parse(handle)
coauthors = [] # start with empty list of coauthors
for record in records:
    au_list = record['AU']
    for author in au_list:
        if author != "Spielman SJ" and author not in coauthors:
            coauthors.append(author)
handle.close()
print('Co-authors of "Spielman SJ" in 2015:')
for author in coauthors:
    print(" ", author)
Co-authors of "Spielman SJ" in 2015:
  Meyer AG
  Bedford T
  Wilke CO
  Kumar K

If this was easy

Problem 3:

For larger searches, NCBI wants you to use the WebEnv method to download all your search results. This is explained in the Biopython tutorial here. Rewrite the influenza search from the section "Running search queries on through Entrez" in such a way that it uses the WebEnv method. For this downloading method, it makes sense to write all the results into a file and then read the results back in.

In [17]:
handle = Entrez.esearch(db="nucleotide",  # database to search
                        term="influenza a virus texas h1n1 hemagglutinin complete cds",  # search term
                        usehistory="y" # this switches on the WebEnv method
                        )
results = Entrez.read(handle)
handle.close()

# Because we ran the search with usehistory="y", the results now contain two additional
# pieces of information, the WebEnv session cookie and the QueryKey:
webenv = results["WebEnv"]
query_key = results["QueryKey"]

# We also get the number of search results:
count = int(results["Count"])

# We now download the results and store them in a local file
# Downloading happens in batches, let's download 20 results at a time:
batch_size = 20
out_handle = open("influenza_HA.gb", "w")
for start in range(0, count, batch_size):
    end = min(count, start+batch_size)
    print("Downloading records %i through %i" % (start+1, end))
    fetch_handle = Entrez.efetch(db="nucleotide", rettype="gb", retmode="text",
                                 retstart=start, retmax=batch_size,
                                 webenv=webenv, query_key=query_key)
    data = fetch_handle.read()
    fetch_handle.close()
    out_handle.write(data)
out_handle.close()
Downloading records 1 through 20
Downloading records 21 through 40
Downloading records 41 through 60
Downloading records 61 through 80
Downloading records 81 through 100
Downloading records 101 through 120
Downloading records 121 through 140
Downloading records 141 through 160
Downloading records 161 through 180
Downloading records 181 through 200
Downloading records 201 through 220
Downloading records 221 through 240
Downloading records 241 through 260
Downloading records 261 through 280
Downloading records 281 through 300
Downloading records 301 through 320
Downloading records 321 through 340
Downloading records 341 through 360
Downloading records 361 through 380
Downloading records 381 through 400
Downloading records 401 through 420
Downloading records 421 through 440
Downloading records 441 through 460
Downloading records 461 through 480
Downloading records 481 through 500
Downloading records 501 through 520
Downloading records 521 through 540
Downloading records 541 through 560
Downloading records 561 through 580
Downloading records 581 through 600
Downloading records 601 through 620
Downloading records 621 through 640
Downloading records 641 through 660
Downloading records 661 through 680
Downloading records 681 through 700
Downloading records 701 through 720
Downloading records 721 through 740
Downloading records 741 through 744
In [18]:
# We can now open the file we created and parse all the records we have previously stored
in_handle = open("influenza_HA.gb", "r")
records = SeqIO.parse(in_handle, "genbank")

for record in records:
    print(record.description)
    
in_handle.close()
Influenza A virus (A/Texas/12/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/07/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/06/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/05/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/157/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/143/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/02/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/148/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/147/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/156/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/154/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/A01785919/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7939/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7923/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7921/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7920/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7918/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7917/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7916/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7914/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7913/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7911/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7910/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7908/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7906/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7904/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7902/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7901/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7898/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7895/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7949/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7947/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7938/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7937/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7936/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7931/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7930/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7929/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7928/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7926/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7925/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7924/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7912/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7907/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7905/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7903/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7900/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7899/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7897/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7896/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7894/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7893/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/152/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/151/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/140/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/A01785906/2019(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7430/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7429/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7389/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/144/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/145/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7670/2018()) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7668/2018()) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7666/2018()) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7664/2018()) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7662/2018()) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7658/2018()) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7656/2018()) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7192/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/142/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/143/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7050/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7045/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7003/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/6992/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/6988/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/6984/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/6980/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/6979/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/6974/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/140/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/134/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/138/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/135/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/141/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/134/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/136/2018(H3N2)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/7504/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/139/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/138/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/A01785737/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/6843/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/A01785599/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/A01785528/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/06/2018(H1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/403/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/402/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/401/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/400/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/93/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/blue-winged teal/Texas/UGAI15-6706/2015(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/blue-winged teal/Texas/UGAI15-5632/2015(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/6417/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/88/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/85/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/84/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/81/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/75/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/74/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/72/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/70/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/01/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/69/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/316/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/314/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/311/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/310/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/08/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/03/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/65/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/323/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/322/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/320/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/318/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/A01785406/2018(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/A01785377/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/316/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/314/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/310/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/311/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/300/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/295/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/294/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/295/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/294/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/A02221802/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/6180/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/6178/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/6176/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/6175/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/6170/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/281/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/28/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/160/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/13/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/157/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/150/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/146/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/144/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/143/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/140/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/115/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/114/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/113/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/112/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/111/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/110/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/109/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/108/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/107/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/106/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/105/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/104/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/103/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/102/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/101/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/100/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/100/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/99/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/98/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/94/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/93/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/79/2015(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/135/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/127/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/122/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/47/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/137/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/132/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/128/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/125/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/117/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/116/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/61/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/55/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/55/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/53/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/51/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/51/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/50/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/38/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/87/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/84/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/81/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/37/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/75/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/73/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/72/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/70/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/69/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/68/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/18/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/16/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/13/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/04/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/03/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/02/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/46/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/40/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/39/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/32/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/31/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/28/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/27/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/27/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/25/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/24/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/20/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/10/2016(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/78/2015(H1N1)) clone 3025624445_284379_v1_4 segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/02/2016(H1N1)) clone 3000415970_284773_v1_4 segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/03/2016(H1N1)) clone 3000415977_284780_v1_4 segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/21/2016(H1N1)) clone 3025624773_ZZYPO19U_v1_4 segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/05/2016(H1N1)) clone 3000416250_285448_v1_4 segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/07/2016(H1N1)) clone 3000416252_285445_v1_4 segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/09/2016(H1N1)) clone 3025624576_285571_v1_4 segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/27/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/09/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/11/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/10/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/16/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/02/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/96/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/82/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/42/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/117/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/08/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/12/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/07/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/36/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/26/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/89/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/86/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/88/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/85/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/87/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/07/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/15/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/13/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/106/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/99/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/10/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/90/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/07/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/24/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/49/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/48/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/24/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/26/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/23/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/27/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/31/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/35/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/33/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/34/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/23/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/23/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/29/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/18/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/05/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/04/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/47/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/36/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/30/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/47/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/22/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/21/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/37/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/02/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/37/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/36/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/79/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/08/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/07/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/14/2010(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/01/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/01/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/04/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/09/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/10/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/48/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/57/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/76/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/77/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/64/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/58/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/62/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/45/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/73/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/59/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/60/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/72/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/56/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/61/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/65/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/63/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/66/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/66/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/01/2010(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/67/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/67/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/75/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/71/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/70/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/69/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/03/2010(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/42/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/01/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/38/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/01/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/36/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/32/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/35/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/23/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/04/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/05/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/29/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/27/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/23/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/16/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/15/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/18/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/17/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/11/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/09/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/10/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/08/2008(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/74/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/78/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/70/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/41/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/32/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/31/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/06/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/05/2007(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/194/2016(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/89/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/63/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/41/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/101/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/83/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/103/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/86/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/81/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/79/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/A02214607/2017(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/69/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/63/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/58/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/57/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/56/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/44/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/42/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/41/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/51/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/49/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/43/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/31/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/37/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/39/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/31/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/34/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/22/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/21/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/08/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/18/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/06/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/08/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/04/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/200/2016(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/199/2016(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/03/2017(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/200/2016(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/199/2016(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/194/2016(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/5485/2016(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/5484/2016(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/5483/2016(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/5028/2016(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/blue-winged teal/Texas/AI12-4549/2012(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/13TOSU0035/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/13TOSU0050/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/A01432097/2012(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/UR06-0422/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/reassortant/IVR-148(Brisbane/59/2007 x Texas/1/1977)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/13TOSU0041/2013(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/A01410177/2014(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3689/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3688/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3687/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3686/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3685/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3684/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3683/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3682/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3681/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3680/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3679/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3678/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3677/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3676/2014(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3675/2014(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3674/2014(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3673/2014(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3672/2014(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3671/2014(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3670/2014(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3669/2014(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3668/2014(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3667/2014(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3666/2014(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3665/2014(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3664/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3663/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3662/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3661/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3660/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3571/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3570/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3520/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3515/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3513/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3512/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3510/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3509/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3508/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3475/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3474/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/3425/2013(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/00525/2005(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/SG1187/2004(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/SG1187/2004(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/SG1162/2003(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/00195/2003(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/SG1162/2003(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/00867/2005(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/00244/2004(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/SG1380/2011(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/01657/2007(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/SG1271/2007(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/01522/2007(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/mallard/Alberta/211/1998(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/ruddy turnstone/Delaware Bay/123/1994(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMM_53/2012(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMM_52/2012(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/36-JY2/1991(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/northern pintail/Texas/421716/2001(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/reassortant/IVR-148(Brisbane/59/2007 x Texas/1/1977)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/reassortant/IVR-148(Brisbane/59/2007 x Texas/1/1977)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/reassortant/IVR-148(Brisbane/59/2007 x Texas/1/1977)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/A01104003/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/swine/Texas/A01104004/2011(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS368/2009(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS360/2009(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/45034240/2009(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS403/2010(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS401/2010(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS400/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS367/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS366/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS415/2010(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS414/2010(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS413/2010(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS412/2010(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS411/2010(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS410/2010(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS409/2010(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS408/2010(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS407/2010(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS406/2010(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS405/2010(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS404/2010(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS402/2010(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/JMS399/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS398/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS397/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS395/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS394/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS393/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS392/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS391/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS390/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS389/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS388/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS387/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS386/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS385/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS384/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS383/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS382/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS381/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS380/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS379/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS373/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS372/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS371/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS370/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS369/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS365/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS364/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS363/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS362/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS361/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS359/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS358/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS356/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46241654/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46240925/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46233104/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46224042/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46223582/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46223444/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46222134/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46221665/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46214103/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46201823/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46193632/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS378/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS377/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS376/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/JMS375/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46193311/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46192760/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/461917783/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46182018/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46181292/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46181235/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46172734/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/46172731/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/44312415/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42211898/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42202026/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42132413/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42254309/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42103399/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42091791/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/43292238/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42151049/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42102708/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42221280/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/43143450/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42201798/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42191647/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/44302551/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/44301765/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42163291/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42303371/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42121926/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42281289/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Brownsville/37OS/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45061755/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45122886/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45032753/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45121004/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45052569/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45113911/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45032708/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45102952/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45062346/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45024243/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/44151841/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45042604/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45023717/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/44152535/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45132202/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45131305/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45072128/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45072656/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45071344/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45131576/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45072273/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45043852/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45021632/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/44282651/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45101422/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45103759/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45101424/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45091417/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45091405/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45091402/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45140902/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45093214/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45083819/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45071524/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45010998/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/44313703/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45093670/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45093846/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45103259/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45062584/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45061670/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/44302167/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/44302533/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45034157/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45033567/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45033774/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45132647/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45131774/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45132788/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45122722/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45122774/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45132214/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45113882/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45103998/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45104048/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45103737/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45104026/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45113371/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45122036/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45122282/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45122033/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45121606/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45122369/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45130742/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45122538/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45120922/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/43122467/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42192947/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/43132503/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42291877/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42142537/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42163295/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/43200999/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/43272683/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/45062633/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42114261/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42113095/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42173957/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42201824/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42122969/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42191653/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42123701/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/43011033/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/43242018/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/42152486/2009(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/47/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/29/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/30/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/05/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/04/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/20/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/39/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/reassortant/IDCDC-RG22(New York/18/2009 x Puerto Rico/8/1934)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/reassortant/IDCDC-RG20(Texas/05/2009 x Puerto Rico/8/1934)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/reassortant/IDCDC-RG18(Texas/05/2009 x New York/18/2009 x Puerto Rico/8/1934)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Italy/85/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/37/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/36/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/34/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/31/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/32/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/11/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/17/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/28/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/10/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/04/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/reassortant/IDCDC-RG15(Texas/05/2009 x Puerto Rico/8/1934)(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/12/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/05/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/09/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/08/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/19/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/22/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/23/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/15/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/07/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/08/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/09/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/06/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/04/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/04/2009(H1N1)) segment 4 hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/WRAIR1118P/2009(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/WRAIR1115P/2009(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/WRAIR1053P/2009(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/X-113(X-31B-Texas/36/1991)(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/36/91(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Wilson-Smith/1933(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/USSR/90/1977(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Texas/36/1991(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/Taiwan/01/1986(H1N1)) hemagglutinin (HA) gene, complete cds
Influenza A virus (A/New Caledonia/20/1999(H1N1)) hemagglutinin (HA) gene, complete cds
Synthetic construct derived from Influenza A virus (A/Texas/15/2009(H1N1)) hemagglutinin gene, complete cds
Influenza A virus (A/Texas/36/91(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0323/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0156/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/36/1991(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0250/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0026/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0217/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0038/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0420/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0503/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0039/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0133/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0216/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0445/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0204/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0357/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0342/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0176/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0305/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0157/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0582/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0308/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0309/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0397/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0468/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0270/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0444/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0175/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0271/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0542/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0540/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0306/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0195/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0303/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0174/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0359/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0193/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0526/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0461/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0563/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0196/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0467/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0203/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0398/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0025/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0380/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0502/2007(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/UR06-0012/2006(H1N1)) segment 4, complete sequence
Influenza A virus (A/Baylor/4052/1981(H1N1)) segment 4, complete sequence
Influenza A virus (A/Texas/2922-3/1986(H1N1)) segment 4, complete sequence
Influenza A virus (A/Lackland/7/1978(H1N1)) segment 4, complete sequence
Influenza A virus (A/Lackland/3/1978(H1N1)) segment 4, complete sequence