GBVRL1.SEQ Genetic Sequence Data Bank 15 February 2001 NCBI-GenBank Flat File Release 122 Viral Sequences (Part 1) 75325 loci, 64018824 bases, from 75325 reported sequences LOCUS A15H9FIB 1228 bp DNA VRL 29-JAN-1996 DEFINITION Adenovirus type 15H9 (Morrison) fibre gene, nonenveloped DNA. ACCESSION X76706 VERSION X76706.1 GI:436054 KEYWORDS fiber gene; fiber protein. SOURCE Human adenovirus type 15. ORGANISM Human adenovirus type 15 Viruses; dsDNA viruses, no RNA stage; Adenoviridae; Mastadenovirus. REFERENCE 1 (bases 1 to 1228) AUTHORS Pring-Akerblom,P. TITLE Direct Submission JOURNAL Submitted (08-DEC-1993) P. Pring-Akerblom, Medizinische Hochschule Hannover, Nationales Referenzzentrum f.Adenoviren, Institut f. Virologie & Seuchenhygiene, Konstanty-Gutschow-Str. 8, 30625 Hannover, FRG REFERENCE 2 (bases 1 to 1228) AUTHORS Pring-Akerblom,P. and Adrian,T. TITLE Characterization of adenovirus subgenus D fiber genes JOURNAL Virology 206 (1), 564-571 (1995) MEDLINE 95133193 FEATURES Location/Qualifiers source 1..1228 /organism="Human adenovirus type 15" /strain="intermediate" /isolate="morrison" /db_xref="taxon:28276" /map="0.88-0.92 units" gene 50..1138 /gene="fiber gene" CDS 50..1138 /gene="fiber gene" /codon_start=1 /product="fiber protein" /protein_id="CAA54127.1" /db_xref="GI:436055" /db_xref="SWISS-PROT:P36846" /translation="MSKRLRVEDDFNPVYPYGYARNQNIPFLTPPFVSSDGFQNFPPG VLSLKLADPIAIVNGNVSLKVGGGLTLQDGTGKLTVNADPPLQLTNNKLGIALDAPFD VIDNKLTLLAGHGLSIITKETSTLPGLRNTLVVLTGKGIGTESTDNGGTVCVRVGEGG GLSFNNDGDLVAFNKKEDKRTLWTTPDTSPNCKIDQDKDSKLTLVLTKCGSQILANVS LIVVDGKYKIINNNTQPALKGFTIKLLFDENGVLMESSNLGKSYWNFRNENSIMSTAY EKAIGFMPNLVAYPKPTAGSKKYARDIVYGNIYLGGKPDQPVTIKTTFNQETGCEYSI TFDFSWAKTYVNVEFETTSFTFSYIAQE" misc_feature 596..1135 /gene="fiber gene" /note="hyper variable region" BASE COUNT 392 a 251 c 231 g 354 t ORIGIN 1 aagggatgtc aaattcctgg tccacaattt tcattgtctt ccctctcaga tgtcaaagag 61 gctccgggtg gaagatgact tcaaccccgt ctacccctat ggctacgcgc ggaatcagaa 121 tatccccttc ctcactcccc cctttgtctc ctccgatgga ttccaaaact tcccccctgg 181 ggtcctgtca ctcaaactag ctgacccaat agccatcgtc aatgggaatg tctcactcaa 241 agtgggaggg ggtctcactt tgcaagatgg aactggaaaa ctaacagtca atgctgatcc 301 acctttgcaa cttacaaaca acaaattagg gattgctttg gacgctccat ttgatgttat 361 agataataaa ctcacattgt tagcgggcca tggcttgtct attataacaa aagaaacatc 421 aacactgcct ggcttgagga atactcttgt agtattaact ggaaagggta ttggaacaga 481 atcaacagat aatggcggaa cggtatgtgt tagagttgga gaaggtggcg gcttatcatt 541 taataatgat ggagacttgg tagcatttaa taaaaaagaa gataagcgca ccctatggac 601 aactccagac acatctccaa attgcaagat tgatcaggat aaggactcta agttaactct 661 ggtccttaca aagtgtggaa gtcaaatatt ggctaatgtg tcattaattg tcgtagatgg 721 taagtacaaa attatcaata acaatactca accagctctc aaaggattta ccattaaatt 781 attgtttgat gaaaatggag tacttatgga atcttcaaat cttggtaaat catattggaa 841 ctttagaaat gaaaattcaa ttatgtcaac agcttatgaa aaagctattg gattcatgcc 901 taatttggta gcctatccaa aacctaccgc tggctctaaa aaatatgcaa gagatatagt 961 ttatggaaac atctaccttg gtggaaagcc agatcaacca gtaaccatta aaactacctt 1021 taatcaggaa actggatgtg aatattctat cacatttgat tttagttggg ccaagactta 1081 tgtaaatgtt gaatttgaaa caacctcttt taccttttcc tatatcgccc aagaatgaaa 1141 gaccaataaa cgtgtttttc atttcaaaat tttcatgtat ctttattgat ttttacacca 1201 gcacgggtag tcagtctccc accaccag // LOCUS A15H9HEX 1528 bp DNA VRL 08-AUG-1995 DEFINITION Adenovirus type 15H9 (Morrison) hexon gene, nonenveloped DNA. ACCESSION X76707 VERSION X76707.1 GI:436056 KEYWORDS hexon gene; hexon protein. SOURCE Human adenovirus type 15. ORGANISM Human adenovirus type 15 Viruses; dsDNA viruses, no RNA stage; Adenoviridae; Mastadenovirus. REFERENCE 1 (bases 1 to 1528) AUTHORS Pring-Akerblom,P. JOURNAL Unpublished REFERENCE 2 (bases 1 to 1528) AUTHORS Pring-Akerblom,P. TITLE Direct Submission JOURNAL Submitted (08-DEC-1993) P. Pring-Akerblom, Medizinische Hochschule Hannover, Nationales Referenzzentrum f.Adenoviren, Institut f. Virologie & Seuchenhygiene, Konstanty-Gutschow-Str. 8, 30625 Hannover, FRG FEATURES Location/Qualifiers source 1..1528 /organism="Human adenovirus type 15" /strain="intermediate" /isolate="morrison" /db_xref="taxon:28276" /map="0.51-0.60 genome%" misc_feature 1..1528 /gene="H9 hexon gene" /note="hypervariable region" gene 1..1528 /gene="H9 hexon gene" CDS <1..>1528 /gene="H9 hexon gene" /codon_start=1 /product="hexon protein" /protein_id="CAA54128.1" /db_xref="GI:939856" /db_xref="SPTREMBL:Q64743" /translation="FDIRGVLDRGPSFKPYSGTAYNSLAPKGAPNSSQWEQKKANAGD QKETHTYGVAPMGGENITISGLQIGTDTTNGKQDPIYANKLYQPEPQVGEENWQETEA FYGGRALKKETKMKPCYGSFARPTNEKGGQAKLRDPEKSQEDFDIDLAFFDTPGGTLT GGGTEYKADIVMCTENVNLETPDTHVVYKPGKDDDSSEINLVQQSMPNRPNYIGFRDN FVGLMYYNSTGNMGVLAGQASQLNAVVDLQDRNTELSYQLLLDSLGDRTRYFSMWNSA VDSYDPDVRIIENHGVEDELPNYCFPLDGSGTNAAYEGVKVKNGQDGVQESEWEKDTN VADRNQICKGNIYAMEINLQANLWKSFLYSNVALYLPDSYKYTPANVTLPTNTNTYEY MNGRVVEKSLVDAYINIGARWSLDPMDNVNPFNHHRNAGLRYRSMLLGNGRYVPFHIQ VPQKFFAIKNLLLLPGSYTYEWNFRKDVNMYLQSSLGNDLRVDGASVRFDSVNLYATF F" BASE COUNT 445 a 407 c 366 g 310 t ORIGIN 1 tttgacatcc gcggcgtcct ggaccgcggt cccagcttca aaccctactc gggcacggcc 61 tacaacagtc tggcccccaa gggcgccccc aactccagtc agtgggaaca gaaaaaggcc 121 aatgctggag atcaaaagga aacacatact tatggtgtag ctcctatggg tggagaaaac 181 attacaatta gcggtttgca aattggaaca gatactacaa atggcaaaca agacccgata 241 tatgctaata agctgtatca accagagcct caagtaggag aagaaaactg gcaggaaaca 301 gaagccttct atggaggaag ggctcttaaa aaggaaacca agatgaaacc atgctatggc 361 tcatttgcca gacccacaaa tgaaaaagga ggacaggcaa aactaagaga ccctgaaaaa 421 agtcaagaag attttgacat agacctagca ttctttgata ctccgggagg aactttaaca 481 ggtggtggaa cggaatacaa agcagacatt gttatgtgca ctgaaaatgt taatcttgaa 541 accccggaca cccacgtggt gtataaacca ggcaaagatg atgacagttc agaaatcaac 601 ttggttcagc agtccatgcc caacagacct aactacatcg gcttcaggga caactttgtg 661 ggtctcatgt actacaacag cactggcaac atgggtgtgc tggccggtca ggcttctcag 721 ttgaatgctg tggtcgactt gcaagacaga aacacagagc tgtcttacca actcttgcta 781 gattctctgg gcgacagaac caggtacttt agcatgtgga actctgcggt ggacagctat 841 gatcccgatg tcaggatcat tgagaatcac ggtgtggaag atgaacttcc caactattgc 901 ttcccattgg atgggtctgg caccaatgct gcttatgaag gtgtaaaagt taaaaatgga 961 caagatgggg tacaagagag cgaatgggaa aaagacacca atgtggcaga tcgaaaccaa 1021 atatgcaagg gcaacatcta cgccatggag atcaacctcc aggccaacct gtggaagagt 1081 tttctgtact cgaatgtggc cctgtacctg cctgactcct acaagtacac gccggccaac 1141 gtcactctgc ccaccaacac caacacctac gagtacatga acggccgtgt ggtggaaaaa 1201 tcgctggtgg acgcctacat caacatcggc gcccgctggt cgttggaccc catggacaac 1261 gtcaacccct tcaaccacca ccgcaatgcg ggcctgcgct accgctccat gttgctaggc 1321 aacggccgct acgtgccctt ccacatccaa gtgccccaaa agttctttgc catcaagaac 1381 ctgctcctgc tcccgggctc ctacacctac gagtggaact tccgcaagga cgtcaacatg 1441 tacctgcaga gttccctcgg aaacgatctg cgcgtcgacg gcgcctccgt ccgcttcgac 1501 agcgtcaacc tgtacgccac cttcttcc // LOCUS A1MVRNA2 2593 bp RNA VRL 17-FEB-1997 DEFINITION Alfalfa mosaic virus (A1M4) RNA 2. ACCESSION X01572 J02002 K02702 VERSION X01572.1 GI:58419 KEYWORDS unidentified reading frame. SOURCE Alfalfa mosaic virus. ORGANISM Alfalfa mosaic virus Viruses; ssRNA positive-strand viruses, no DNA stage; Bromoviridae; Alfamovirus. REFERENCE 1 (bases 1 to 2593) AUTHORS Cornelissen,B.J., Brederode,F.T., Veeneman,G.H., van Boom,J.H. and Bol,J.F. TITLE Complete nucleotide sequence of alfalfa mosaic virus RNA 2 JOURNAL Nucleic Acids Res. 11 (10), 3019-3025 (1983) MEDLINE 83220723 COMMENT Data kindly reviewed (30-JAN-1986) by J.F. Bol. FEATURES Location/Qualifiers source 1..2593 /organism="Alfalfa mosaic virus" /virion /db_xref="taxon:12321" /note="RNA2" CDS 55..2427 /codon_start=1 /product="89.7 kd protein" /protein_id="CAA25728.1" /db_xref="GI:58420" /db_xref="SWISS-PROT:P03593" /translation="MFTLLRCLGFGVNEPTNTSSSEYVPEYSVEEISNEVAELDSVDP LFQCYKHVFVSLMLVRKMTQAAEDFLESFGGEFDSPCCRVYRLYRHFVNEDDAPAWAI PNVVNEDSYDDYAYLREELDAIDSSFELLNEERELSEITDRLNALRFFPVSKTEALPV ANVQEVKLISETYQLLMTFINYSDENIPSEMPAPLLDELGMLPEELGPLNEIEDIKPV AAPITLLSEFRASDNAKPLDIVEIIPDVSPTKPYEAVISGNDWMTLGRIIPTTPVPTI RDVFFSGLSRHGSPEVIQNALDEFLPLHHSIDDKYFQEWVETSDKSLDVDPCRIDLSV FNNWQSSENCYEPRFKTGALSTRKGTQTEALLAIKKRNMNVPNLGQIYDVNSVANSVV NKLLTTVIDPDKLCMFPDFISEGEVSYFQDYIVGKNPDPELYSDPLGVRSIDSYKHMI KSVLKPVEDNSLHLERPMPATITYHDKDIVMSSSPIFLAAAARLMLILRDKITIPSGK FHQLFSIDAEAFDASFHFKEIDFSKFDKSQNELHHLIQERFLKYLGIPNEFLTLWFNA HRKSRISDSKNGVFFNVDFQRRTGDALTYLGNTIVTLACLCHVYDLMDPNVKFVVASG DDSLIGTVEELPRDQEFLFTTLFNLEAKFPHNQPFICSKFLITMPTTSGGKVVLPIPN PLKLLIRLGSKKVNADIFDEWYQSWIDIIGGFNDHHVIRCVAAMTAHRYLRRPSLYLE AALESLGKIFAGKTLCKECLFNEKHESNVKIKPRRVKKSHSDARSRARRA" variation 1431 /note="c in pAL2-1; t in pAL2-[21,41]" BASE COUNT 736 a 533 c 547 g 777 t ORIGIN 1 gtttttatct tttcgcgatt gaaaagataa gtttttcagt ttaatctttt caatatgttc 61 actcttttga gatgtctcgg attcggtgtt aatgaaccta ctaacacttc ctcatcagag 121 tatgttcccg agtattccgt tgaagagatt tccaacgaag tcgctgaact cgattcagtg 181 gatccattat tccaatgtta caaacatgtt tttgtatcat tgatgctcgt aagaaagatg 241 actcaagctg ccgaagactt cctcgagagt tttgggggag aattcgatag cccttgttgt 301 agggtttacc gtctttatag acattttgtt aatgaagacg atgcacccgc ttgggccata 361 ccgaatgtcg tgaatgaaga ttcttacgac gattatgcct acctccgaga ggagttagat 421 gccatagaca gctcttttga gttgctaaac gaagagcgtg agttatcgga aattacggac 481 agactcaacg ctttaagatt tttccctgtt tctaaaacag aagcgctacc agtggcgaat 541 gtccaagagg tcaaactcat ttctgagaca taccagttat tgatgacctt tattaactac 601 tctgacgaga atattccgtc tgaaatgccc gcaccattac tggatgagtt ggggatgtta 661 ccggaggaac ttggacctct gaatgaaatt gaagacatta agccggtggc ggctccaatc 721 acattactat ctgagtttag agcctcagat aatgctaagc cactcgacat agtcgaaatc 781 attccagacg taagtccgac gaaaccttat gaagccgtca tatcaggtaa tgattggatg 841 acgttgggga ggatcatacc taccactccc gttcctacca taagggatgt cttcttctct 901 ggtctttctc ggcacggatc gccggaagtg atccagaatg ctcttgatga atttcttccg 961 ctccatcatt caattgatga taagtatttt caagaatggg ttgaaacctc agataaatct 1021 ctcgatgtcg atccatgtcg aatcgatctg agtgttttca acaactggca gtcttcggaa 1081 aactgctatg aacctcggtt taaaaccggt gcattatcca cacgtaaggg cactcaaact 1141 gaagccctat tagcgataaa gaaacgtaat atgaatgtgc ctaacctggg gcagatttat 1201 gacgtgaatt ctgttgctaa ttccgtggtt aataagctct taacaactgt tatagatcct 1261 gataagctgt gcatgtttcc agatttcata tctgagggtg aagtttcgta tttccaggac 1321 tatatagttg ggaagaatcc cgaccctgaa ttatattcag atcctctagg tgttcgttcc 1381 atcgatagct ataaacacat gattaaatcc gtgttaaagc ccgttgaaga taattctctg 1441 cacctagaac ggccgatgcc agcaaccata acataccatg ataaagatat cgtgatgtca 1501 tcttcaccaa tttttttggc tgctgctgcc cgcttgatgt taatcttaag agataagata 1561 accataccaa gcggaaaatt ccatcaattg ttttccatcg atgctgaagc ctttgatgca 1621 agtttccatt ttaaagagat agacttttcg aagtttgaca aaagtcaaaa tgagttgcat 1681 cacttgatcc aggaaaggtt tctgaaatac ttaggtatac ccaacgaatt tctaacctta 1741 tggtttaatg cgcatagaaa atcccgaatc tcagattcga agaatggcgt tttttttaac 1801 gtcgatttcc aacgtcgtac tggagatgcg ctcacgtact tgggaaacac aatagtgaca 1861 ttagcttgtc tgtgtcacgt gtatgacttg atggacccaa atgtgaaatt cgttgttgct 1921 tccggtgatg attcattgat aggcactgtg gaggaattac caagagatca agagtttctt 1981 ttcacgactc tttttaatct tgaagcaaag tttcctcata accagccttt catatgcagt 2041 aagtttttga ttactatgcc cactacaagt ggaggcaaag ttgtcctgcc gataccgaat 2101 ccattgaaac tcctcatacg cttgggttcg aagaaagtca atgccgatat attcgatgaa 2161 tggtatcaat cttggattga tataattggt ggttttaacg accaccatgt catccgatgc 2221 gttgccgcga tgacagcaca taggtatctc agaagaccga gtttatacct agaagctgct 2281 ttggaatccc taggtaagat cttcgctggt aagaccttgt gtaaggaatg cctctttaat 2341 gagaagcacg agtctaatgt aaaaattaag cctcgtagag tgaaaaaatc ccactcggat 2401 gccaggtcaa gggcacgccg agcttgatgt tttcttgaca taagtcaaat tgccaacctc 2461 cactgggtgg gtcaaggttg aggtatagaa tcctattcgc tcctgatagg agaaattcta 2521 tattgcttat atatgtgctt acgcacatat ataaatgctc atgcaaaact gcatgaatgc 2581 ccctaaggga tgc // LOCUS AA2CG 4675 bp ss-DNA VRL 27-APR-1993 DEFINITION Adeno-associated virus 2, complete genome. ACCESSION J01901 M12405 M12468 M12469 VERSION J01901.1 GI:209616 KEYWORDS alternative splicing; complete genome; major coat protein. SOURCE Adeno-associated virus 2 DNA from human HeLa cells. ORGANISM adeno-associated virus 2 Viruses; ssDNA viruses; Parvoviridae; Parvovirinae; Dependovirus. REFERENCE 1 (bases 4532 to 4675) AUTHORS Samulski,R.J., Srivastava,A., Berns,K.I. and Muzyczka,N. TITLE Rescue of adeno-associated virus from recombinant plasmids: Gene correction within the terminal repeats of AAV JOURNAL Cell 33, 135-143 (1983) MEDLINE 84282662 REFERENCE 2 (bases 1 to 4675) AUTHORS Srivastava,A., Lusby,E.W. and Berns,K.I. TITLE Nucleotide sequence and organization of the adeno-associated virus 2 genome JOURNAL J. Virol. 45, 555-564 (1983) MEDLINE 83164299 FEATURES Location/Qualifiers source 1..4675 /organism="adeno-associated virus 2" /db_xref="taxon:10804" repeat_region 1..145 /note="5' inverted terminal repeat" misc_feature 42..83 /note="flip oriented DNA" mRNA 287..4447 /note="major coat protein A mRNA (alt.)" CDS join(321..1906,2228..2252) /note="major coat protein A' (alt.)" /codon_start=1 /protein_id="AAA42372.1" /db_xref="GI:209617" /translation="MPGFYEIVIKVPSDLDGHLPGISDSFVNWVAEKEWELPPDSDMD LNLIEQAPLTVAEKLQRDFLTEWRRVSKAPEALFFVQFEKGESYFHMHVLVETTGVKS MVLGRFLSQIREKLIQRIYRGIEPTLPNWFAVTKTRNGAGGGNKVVDECYIPNYLLPK TQPELQWAWTNMEQYLSACLNLTERKRLVAQHLTHVSQTQEQNKENQNPNSDAPVIRS KTSARYMELVGWLVDKGITSEKQWIQEDQASYISFNAASNSRSQIKAALDNAGKIMSL TKTAPDYLVGQQPVEDISSNRIYKILELNGYDPQYAASVFLGWATKKFGKRNTIWLFG PATTGKTNIAEAIAHTVPFYGCVNWTNENFPFNDCVDKMVIWWEEGKMTAKVVESAKA ILGGSKVRVDQKCKSSAQIDPTPVIVTSNTNMCAVIDGNSTTFEHQQPLQDRMFKFEL TRRLDHDFGKVTKQEVKDFFRWAKDHVVEVEHEFYVKKGGAKKRPAPSDADISEPKRV RESVAQPSTSDAEASINYADRLARGHSL" CDS 321..2186 /note="major coat protein A" /codon_start=1 /protein_id="AAA42374.1" /db_xref="GI:209619" /translation="MPGFYEIVIKVPSDLDGHLPGISDSFVNWVAEKEWELPPDSDMD LNLIEQAPLTVAEKLQRDFLTEWRRVSKAPEALFFVQFEKGESYFHMHVLVETTGVKS MVLGRFLSQIREKLIQRIYRGIEPTLPNWFAVTKTRNGAGGGNKVVDECYIPNYLLPK TQPELQWAWTNMEQYLSACLNLTERKRLVAQHLTHVSQTQEQNKENQNPNSDAPVIRS KTSARYMELVGWLVDKGITSEKQWIQEDQASYISFNAASNSRSQIKAALDNAGKIMSL TKTAPDYLVGQQPVEDISSNRIYKILELNGYDPQYAASVFLGWATKKFGKRNTIWLFG PATTGKTNIAEAIAHTVPFYGCVNWTNENFPFNDCVDKMVIWWEEGKMTAKVVESAKA ILGGSKVRVDQKCKSSAQIDPTPVIVTSNTNMCAVIDGNSTTFEHQQPLQDRMFKFEL TRRLDHDFGKVTKQEVKDFFRWAKDHVVEVEHEFYVKKGGAKKRPAPSDADISEPKRV RESVAQPSTSDAEASINYADRYQNKCSRHVGMNLMLFPCRQCERMNQNSNICFTHGQK DCLECFPVSESQPVSVVKKAYQKLCYIHHIMGKVPDACTACDLVNVDLDDCIFEQ" mRNA 873..4447 /note="major coat protein A mRNA (alt.)" CDS join(993..1906,2228..2252) /note="major coat protein Aa (alt.)" /codon_start=1 /protein_id="AAA42373.1" /db_xref="GI:209618" /translation="MELVGWLVDKGITSEKQWIQEDQASYISFNAASNSRSQIKAALD NAGKIMSLTKTAPDYLVGQQPVEDISSNRIYKILELNGYDPQYAASVFLGWATKKFGK RNTIWLFGPATTGKTNIAEAIAHTVPFYGCVNWTNENFPFNDCVDKMVIWWEEGKMTA KVVESAKAILGGSKVRVDQKCKSSAQIDPTPVIVTSNTNMCAVIDGNSTTFEHQQPLQ DRMFKFELTRRLDHDFGKVTKQEVKDFFRWAKDHVVEVEHEFYVKKGGAKKRPAPSDA DISEPKRVRESVAQPSTSDAEASINYADRLARGHSL" CDS 993..2186 /note="major coat protein A'' (alt.)" /codon_start=1 /protein_id="AAA42375.1" /db_xref="GI:209620" /translation="MELVGWLVDKGITSEKQWIQEDQASYISFNAASNSRSQIKAALD NAGKIMSLTKTAPDYLVGQQPVEDISSNRIYKILELNGYDPQYAASVFLGWATKKFGK RNTIWLFGPATTGKTNIAEAIAHTVPFYGCVNWTNENFPFNDCVDKMVIWWEEGKMTA KVVESAKAILGGSKVRVDQKCKSSAQIDPTPVIVTSNTNMCAVIDGNSTTFEHQQPLQ DRMFKFELTRRLDHDFGKVTKQEVKDFFRWAKDHVVEVEHEFYVKKGGAKKRPAPSDA DISEPKRVRESVAQPSTSDAEASINYADRYQNKCSRHVGMNLMLFPCRQCERMNQNSN ICFTHGQKDCLECFPVSESQPVSVVKKAYQKLCYIHHIMGKVPDACTACDLVNVDLDD CIFEQ" mRNA 1853..4447 /note="major coat protein B mRNA (alt.)" intron 1907..2227 /note="major coat protein A intron" CDS 2810..4324 /note="major coat protein B" /codon_start=1 /protein_id="AAA42376.1" /db_xref="GI:209621" /translation="MATGSGAPMADNNEGADGVGNSSGNWHCDSTWMGDRVITTSTRT WALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQRLINNNW GFRPKRLNFKLFNIQVKEVTQNDGTTTIANNLTSTVQVFTDSEYQLPYVLGSAHQGCL PPFPADVFMVPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFTFSYTFEDVPF HSSYAHSQSLDRLMNPLIDQYLYYLSRTNTPSGTTTQSRLQFSQAGASDIRDQSRNWL PGPCYRQQRVSKTSADNNNSEYSWTGATKYHLNGRDSLVNPAMASHKDDEEKFFPQSG VLIFGKQGSEKTNVNIEKVMITDEEEIGTTNPVATEQYGSVSTNLQRGNRQAATADVN TQGVLPGMVWQDRDVYLQGPIWAKIPHTDGHFHPSPLMGGFGLKHPPPQILIKNTPVP ANPSTTFSAAKFASFITQYSTGHGQRGDRVGAAEGKQQTLESRNSVHFQLQQVC" repeat_region 4531..4675 /note="3' inverted terminal repeat" misc_feature 4592..4634 /note="flop oriented DNA" BASE COUNT 1198 a 1262 c 1251 g 964 t ORIGIN 5' end of genomic DNA. 1 ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 61 cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 121 gccaactcca tcactagggg ttcctggagg ggtggagtcg tgacgtgaat tacgtcatag 181 ggttagggag gtcctgtatt agaggtcacg tgagtgtttt gcgacatttt gcgacaccat 241 gtggtcacgc tgggtattta agcccgagtg agcacgcagg gtctccattt tgaagcggga 301 ggtttgaacg cgcagccgcc atgccggggt tttacgagat tgtgattaag gtccccagcg 361 accttgacgg gcatctgccc ggcatttctg acagctttgt gaactgggtg gccgagaagg 421 aatgggagtt gccgccagat tctgacatgg atctgaatct gattgagcag gcacccctga 481 ccgtggccga gaagctgcag cgcgactttc tgacggaatg gcgccgtgtg agtaaggccc 541 cggaggccct tttctttgtg caatttgaga agggagagag ctacttccac atgcacgtgc 601 tcgtggaaac caccggggtg aaatccatgg ttttgggacg tttcctgagt cagattcgcg 661 aaaaactgat tcagagaatt taccgcggga tcgagccgac tttgccaaac tggttcgcgg 721 tcacaaagac cagaaatggc gccggaggcg ggaacaaggt ggtggatgag tgctacatcc 781 ccaattactt gctccccaaa acccagcctg agctccagtg ggcgtggact aatatggaac 841 agtatttaag cgcctgtttg aatctcacgg agcgtaaacg gttggtggcg cagcatctga 901 cgcacgtgtc gcagacgcag gagcagaaca aagagaatca gaatcccaat tctgatgcgc 961 cggtgatcag atcaaaaact tcagccaggt acatggagct ggtcgggtgg ctcgtggaca 1021 aggggattac ctcggagaag cagtggatcc aggaggacca ggcctcatac atctccttca 1081 atgcggcctc caactcgcgg tcccaaatca aggctgcctt ggacaatgcg ggaaagatta 1141 tgagcctgac taaaaccgcc cccgactacc tggtgggcca gcagcccgtg gaggacattt 1201 ccagcaatcg gatttataaa attttggaac taaacgggta cgatccccaa tatgcggctt 1261 ccgtctttct gggatgggcc acgaaaaagt tcggcaagag gaacaccatc tggctgtttg 1321 ggcctgcaac taccgggaag accaacatcg cggaggccat agcccacact gtgcccttct 1381 acgggtgcgt aaactggacc aatgagaact ttcccttcaa cgactgtgtc gacaagatgg 1441 tgatctggtg ggaggagggg aagatgaccg ccaaggtcgt ggagtcggcc aaagccattc 1501 tcggaggaag caaggtgcgc gtggaccaga aatgcaagtc ctcggcccag atagacccga 1561 ctcccgtgat cgtcacctcc aacaccaaca tgtgcgccgt gattgacggg aactcaacga 1621 ccttcgaaca ccagcagccg ttgcaagacc ggatgttcaa atttgaactc acccgccgtc 1681 tggatcatga ctttgggaag gtcaccaagc aggaagtcaa agactttttc cggtgggcaa 1741 aggatcacgt ggttgaggtg gagcatgaat tctacgtcaa aaagggtgga gccaagaaaa 1801 gacccgcccc cagtgacgca gatataagtg agcccaaacg ggtgcgcgag tcagttgcgc 1861 agccatcgac gtcagacgcg gaagcttcga tcaactacgc agacaggtac caaaacaaat 1921 gttctcgtca cgtgggcatg aatctgatgc tgtttccctg cagacaatgc gagagaatga 1981 atcagaattc aaatatctgc ttcactcacg gacagaaaga ctgtttagag tgctttcccg 2041 tgtcagaatc tcaacccgtt tctgtcgtca aaaaggcgta tcagaaactg tgctacattc 2101 atcatatcat gggaaaggtg ccagacgctt gcactgcctg cgatctggtc aatgtggatt 2161 tggatgactg catctttgaa caataaatga tttaaatcag gtatggctgc cgatggttat 2221 cttccagatt ggctcgagga cactctctct gaaggaataa gacagtggtg gaagctcaaa 2281 cctggcccac caccaccaaa gcccgcagag cggcataagg acgacagcag gggtcttgtg 2341 cttcctgggt acaagtacct cggacccttc aacggactcg acaagggaga gccggtcaac 2401 gaggcagacg ccgcggccct cgagcacgta caaagcctac gaccggcagc tcgacagcgg 2461 agacaacccg tacctcaagt acaaccacgc cgacgcggag tttcaggagc gccttaaaga 2521 agatacgtct tttgggggca acctcggacg agcagtcttc caggcgaaaa agagggttct 2581 tgaacctctg ggcctggttg aggaacctgt taagacggct ccgggaaaaa agaggccggt 2641 agagcactct cctgtggagc cagactcctc ctcgggaacc ggaaaggcgg gccagcagcc 2701 tgcaagaaaa agattgaatt ttggtcagac tggagacgca gactcagtac ctgaccccca 2761 gcctctcgga cagccaccag cagccccctc tggtctggga actaatacga tggctacagg 2821 cagtggcgca ccaatggcag acaataacga gggcgccgac ggagtgggta attcctccgg 2881 aaattggcat tgcgattcca catggatggg cgacagagtc atcaccacca gcacccgaac 2941 ctgggccctg cccacctaca acaaccacct ctacaaacaa atttccagcc aatcaggagc 3001 ctcgaacgac aatcactact ttggctacag caccccttgg gggtattttg acttcaacag 3061 attccactgc cacttttcac cacgtgactg gcaaagactc atcaacaaca actggggatt 3121 ccgacccaag agactcaact tcaagctctt taacattcaa gtcaaagagg tcacgcagaa 3181 tgacggtacg acgacgattg ccaataacct taccagcacg gttcaggtgt ttactgactc 3241 ggagtaccag ctcccgtacg tcctcggctc ggcgcatcaa ggatgcctcc cgccgttccc 3301 agcagacgtc ttcatggtgc cacagtatgg atacctcacc ctgaacaacg ggagtcaggc 3361 agtaggacgc tcttcatttt actgcctgga gtactttcct tctcagatgc tgcgtaccgg 3421 aaacaacttt accttcagct acacttttga ggacgttcct ttccacagca gctacgctca 3481 cagccagagt ctggaccgtc tcatgaatcc tctcatcgac cagtacctgt attacttgag 3541 cagaacaaac actccaagtg gaaccaccac gcagtcaagg cttcagtttt ctcaggccgg 3601 agcgagtgac attcgggacc agtctaggaa ctggcttcct ggaccctgtt accgccagca 3661 gcgagtatca aagacatctg cggataacaa caacagtgaa tactcgtgga ctggagctac 3721 caagtaccac ctcaatggca gagactctct ggtgaatccg gccatggcaa gccacaagga 3781 cgatgaagaa aagttttttc ctcagagcgg ggttctcatc tttgggaagc aaggctcaga 3841 gaaaacaaat gtgaacattg aaaaggtcat gattacagac gaagaggaaa tcggaacaac 3901 caatcccgtg gctacggagc agtatggttc tgtatctacc aacctccaga gaggcaacag 3961 acaagcagct accgcagatg tcaacacaca aggcgttctt ccaggcatgg tctggcagga 4021 cagagatgtg taccttcagg ggcccatctg ggcaaagatt ccacacacgg acggacattt 4081 tcacccctct cccctcatgg gtggattcgg acttaaacac cctcctccac agattctcat 4141 caagaacacc ccggtacctg cgaatccttc gaccaccttc agtgcggcaa agtttgcttc 4201 cttcatcaca cagtactcca cgggacacgg tcagcgtgga gatcgagtgg gagctgcaga 4261 aggaaaacag caaacgctgg aatcccgaaa ttcagtacac ttccaactac aacaagtctg 4321 ttaatcgtgg acttaccgtg gatactaatg gcgtgtattc agagcctcgc cccattggca 4381 ccagatacct gactcgtaat ctgtaattgc ttgttaatca ataaaccgtt taattcgttt 4441 cagttgaact ttggtctctg cgtatttctt tcttatctag tttccatggc tacgtagata 4501 agtagcatgg cgggttaatc attaactaca aggaacccct agtgatggag ttggccactc 4561 cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg 4621 gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg gccaa // LOCUS AA2LEFT 2116 bp DNA VRL 27-APR-1993 DEFINITION adeno-associated virus 2 left half 45% of genome. ACCESSION J01902 VERSION J01902.1 GI:209622 KEYWORDS . SOURCE adeno-associated virus 2 from human hela cells. ORGANISM adeno-associated virus 2 Viruses; ssDNA viruses; Parvoviridae; Parvovirinae; Dependovirus. REFERENCE 1 (bases 1 to 2116) AUTHORS Lusby,E.W. and Berns,K.I. TITLE mapping of the 5' termini of two adeno-associated virus 2 rnas in the left half of the genome JOURNAL J. Virol. 41, 518-526 (1982) MEDLINE 82192580 FEATURES Location/Qualifiers source 1..2116 /organism="adeno-associated virus 2" /db_xref="taxon:10804" BASE COUNT 526 a 539 c 615 g 436 t ORIGIN 1 ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 61 cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 121 gccaactcca tcactagggg ttcctggagg ggtggagtcg tgacgtgaat tacgtcatag 181 ggttagggag gtcctgtatt agaggtcacg tgagtgtttt gcgacatttt gcgacaccat 241 gtggtcacgc tgggtattta agcccgagtg agcacgcagg gtctccattt tgaagcggga 301 ggtttgaacg cgcagccgcc atgccggggt tttacgagat tgtgattaag gtccccagcg 361 accttgacgg gcatctgccc ggcatttctg acagctttgt gaactgggtg gccgagaagg 421 aatgggagtt gccgccagat tctgacatgg atctgaatct gattgagcag gcacccctga 481 ccgtggccga gaagctgcag cgcgactttc tgacggaatg gcgccgtgtg agtaaggccc 541 cggaggccct tttctttgtg caatttgaga agggagagag ctacttccac atgcacgtgc 601 tcgtggaaac caccggggtg aaatccatgg ttttgggacg tttcctgagt cagattcgcg 661 aaaaactgat tcagagaatt taccgcggga tcgagccgac tttgccaaac tggttcgcgg 721 tcacaaagac cagaaatggc gccggaggcg ggaacaaggt ggtggatgag tgctacatcc 781 ccaattactt gctccccaaa acccagcctg agctccagtg ggcgtggact aatatggaac 841 agtatttaag cgcctgtttg aatctcacgg agcgtaaacg gttggtggcg cagcatctga 901 cgcacgtgtc gcagacgcag gagcagaaca aagagaatca gaatcccaat tctgatgcgc 961 cggtgatcag atcaaaaact tcagccaggt acatggagct ggtcgggtgg ctcgtggaca 1021 aggggattac ctcggagaag cagtggatcc aggaggacca ggcctcatac atctccttca 1081 atgcggcctc caactcgcgg tcccaaatca aggctgcctt ggacaatgcg ggaaagatta 1141 tgagcctgac taaaaccgcc cccgactacc tggtgggcca gcagcccgtg gaggacattt 1201 ccagcaatcg gatttataaa attttggaac taaacgggta cgatccccaa tatgcggctt 1261 ccgtctttct gggatgggcc acgaaaaagt tcggcaagag gaacaccatc tggctgtttg 1321 ggcctgcaac taccgggaag accaacatcg cggaggccat agcccacact gtgcccttct 1381 acgggtgcgt aaactggacc aatgagaact ttcccttcaa cgactgtgtc gacaagatgg 1441 tgatctggtg ggaggagggg aagatgaccg ccaaggtcgt ggagtcggcc aaagccattc 1501 tcggaggaag caaggtgcgc gtggaccaga aatgcaagtc ctcggcccag atagacccga 1561 ctcccgtgat cgtcacctcc aacaccaaca tgtgcgccgt gattgacggg aactcaacga 1621 ccttcgaaca ccagcagccg ttgcaagacc ggatgttcaa atttgaactc acccgccgtc 1681 tggatcatga ctttgggaag gtcaccaagc aggaagtcaa agactttttc cggtgggcaa 1741 aggatcacgt ggttgaggtg gagcatgaat tctacgtcaa aaagggtgga gccaagaaaa 1801 gacccgcccc cagtgacgca gatataagtg agcccaaacg ggtgcgcgag tcagttgcgc 1861 agccatcgac gtcagacgcg gaagcttcga tcaactacgc agacaggtac caaaacaaat 1921 gttctcgtca cgtgggcatg aatctgatgc tgtttccctg cagacaatgc gagagaatga 1981 atcagaattc aaatatctgc ttcactcacg gacagaaaga ctgtttagag tgctttcccg 2041 tgtcagaatc tcaacccgtt tctgtcgtca aaaaggcgta tcagaaactg tgctacattc 2101 atcatatcat gggaaa // LOCUS AA2LTR1 145 bp DNA VRL 27-APR-1993 DEFINITION Adeno-associated virus 2 left terminal sequence. ACCESSION K01624 VERSION K01624.1 GI:209623 KEYWORDS replication; terminal repeat. SEGMENT 1 of 2 SOURCE Adeno-associated virus 2H DNA, (clone pSM620 [2]), from KB or HeLa cells. ORGANISM adeno-associated virus 2H Viruses; ssDNA viruses; Parvoviridae; Parvovirinae; Dependovirus. REFERENCE 1 (bases 1 to 145) AUTHORS Lusby,E., Fife,K.H. and Berns,K.I. TITLE Nucleotide sequence of the inverted terminal repetition in adeno-associated virus DNA JOURNAL J. Virol. 34, 402-409 (1980) MEDLINE 80185149 REFERENCE 2 (bases 1 to 145) AUTHORS Lefebvre,R.B., Riva,S. and Berns,K.I. TITLE Conformation takes precedence over sequence in adeno-associated virus DNA replication JOURNAL Mol. Cell. Biol. 4, 1416-1419 (1984) MEDLINE 85061247 COMMENT Both [1] and [2] present the opposite strand from the one presented here. The focus of both papers is the method of replication of the virus. [1] notes that the initial tt is present only 30% of the time; it is shortened to t in 50% of the population and missing altogether in 15% of the population. There is further sequence heterogeneity which can be explained by assuming that the terminal 125 bases, which form an imperfect palindrome, are replaced by their inverted complement during replication. [2] found that deletion of the 9 terminal bases on the right and the 113 terminal bases on the left of AAV 2 genome did not stop DNA replication. Further deletion of an 11-base symmetrical sequence (bases 89 to 99) in the right terminal repetition inhibits DNA replication. Substitution of either an 8-base (cagatctg) or 12-base (cgcggatccgcg) symmetrical sequence unrelated to the original 11-base sequence restores DNA replication. All of this can be explained by assuming that the 125 base palindrome mentioned above form a t-shaped secondary structure which provides a primer for DNA polymerase during replication. FEATURES Location/Qualifiers source 1..145 /organism="adeno-associated virus 2H" /db_xref="taxon:10805" BASE COUNT 21 a 52 c 49 g 23 t ORIGIN 2 bases upstream of HaeIII site. 1 ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 61 cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 121 gccaactcca tcactagggg ttcct // LOCUS AA2LTR2 145 bp DNA VRL 27-APR-1993 DEFINITION Adeno-associated virus 2 right terminal sequence. ACCESSION K01625 VERSION K01625.1 GI:209624 KEYWORDS replication; terminal repeat. SEGMENT 2 of 2 SOURCE Adeno-associated virus 2H DNA (clone pSM620 [2]), from KB or HeLa cells. ORGANISM adeno-associated virus 2H Viruses; ssDNA viruses; Parvoviridae; Parvovirinae; Dependovirus. REFERENCE 1 (bases 1 to 145) AUTHORS Lusby,E., Fife,K.H. and Berns,K.I. TITLE Nucleotide sequence of the inverted terminal repetition in adeno-associated virus DNA JOURNAL J. Virol. 34, 402-409 (1980) MEDLINE 80185149 REFERENCE 2 (bases 1 to 145) AUTHORS Lefebvre,R.B., Riva,S. and Berns,K.I. TITLE Conformation takes precedence over sequence in adeno-associated virus DNA replication JOURNAL Mol. Cell. Biol. 4, 1416-1419 (1984) MEDLINE 85061247 COMMENT The focus of both papers is the method of replication of the virus. [1] notes that the initial tt is present only 30% of the time; it is shortened to t in 50% of the population and missing altogether in 15% of the population. There is further sequence heterogeneity which can be explained by assuming that the terminal 125 bases, which form an imperfect palindrome, are replaced by their inverted complement during replication. [2] found that deletion of the 9 terminal bases on the right and the 113 terminal bases on the left of AAV 2 genome did not stop DNA replication. Further deletion of an 11-base symmetrical sequence (bases 89 to 99) in the right terminal repetition inhibits DNA replication. Substitution of either an 8-base (cagatctg) or 12-base (cgcggatccgcg) symmetrical sequence unrelated to the original 11-base sequence restores DNA replication. All of this can be explained by assuming that the 125 base palindrome mentioned above form a t-shaped secondary structure which provides a primer for DNA polymerase during replication. FEATURES Location/Qualifiers source 1..145 /organism="adeno-associated virus 2H" /db_xref="taxon:10805" BASE COUNT 24 a 49 c 52 g 20 t ORIGIN 22 bases upstream of HaeIII sites. 1 aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg 61 ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc 121 gagcgcgcag agagggagtg gccaa // LOCUS AA2REPORI 145 bp ss-DNA VRL 27-APR-1993 DEFINITION Adeno-associated virus origin of replication (genome 3' terminus). ACCESSION M10681 VERSION M10681.1 GI:209626 KEYWORDS . SOURCE Adeno associated virus 2H (AAV2 H) DNA. ORGANISM adeno-associated virus 2H Viruses; ssDNA viruses; Parvoviridae; Parvovirinae; Dependovirus. REFERENCE 1 (bases 1 to 145) AUTHORS Berns,K.I., Hauswirth,W.W., Fife,K.H. and Lusby,E. TITLE Adeno-associated virus DNA replication JOURNAL Cold Spring Harb. Symp. Quant. Biol. 43, 781-787 (1979) MEDLINE 80023388 FEATURES Location/Qualifiers source 1..145 /organism="adeno-associated virus 2H" /db_xref="taxon:10805" BASE COUNT 23 a 49 c 52 g 21 t ORIGIN 1 aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg 61 ccgcccgggc aaagcccggg cgtcgggcga cctttggtcg cccggcctca gtgagcgagc 121 gagcgcgcag agagggagtg gccaa // LOCUS AAFVMAF 3171 bp ss-RNA VRL 27-APR-1993 DEFINITION Avian musculoaponeurotic fibrosarcoma virus AS42-specific fusion protein mRNA, 3' end, and env protein mRNA, 5' end. ACCESSION M26769 VERSION M26769.1 GI:209627 KEYWORDS AS42-specific fusion protein; c-myc proto-oncogene; env protein; leucine zipper. SOURCE Avian musculoaponeurotic fibrosarcoma virus, cDNA to viral RNA. ORGANISM Avian musculoaponeurotic fibrosarcoma virus Viruses; Retroid viruses; Retroviridae; Avian type C retroviruses. REFERENCE 1 (bases 1 to 3171) AUTHORS Kawai,S. JOURNAL Unpublished (1989) REFERENCE 2 (bases 721 to 3000) AUTHORS Nihizawa,M., Kataoka,K., Goto,N., Fujiwara,K.T. and Kawai,S. TITLE v-maf, a viral oncogene that encodes a 'leucine zipper' motif JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7711-7715 (1989) MEDLINE 90046665 COMMENT Draft entry and computer-readable copy of sequence [2] kindly submitted by S.Kawai, 10-AUG-1989. FEATURES Location/Qualifiers source 1..3171 /organism="Avian musculoaponeurotic fibrosarcoma virus" /db_xref="taxon:11959" CDS <1..1947 /note="AS42-specific fusion protein" /codon_start=1 /protein_id="AAA42377.1" /db_xref="GI:209628" /translation="PALTDWARIREELASTGPPVVAMPVVIKTEGPAWTPLEPKLITR LADTVRTKGLRSPITMAEVEALMSSPLLPHDVTNLMRVILGPAPYALWMDAWGVQLQT VIAAATRDPRHPANGQGRGERTNLDRLKGLADGMVGNPDGQAALLRPGELVAITASAL QAFREVARLAEPTDPWAEITQGPSESFVDFANRLIKAVEGSDLPPTARAPVIIDCFRQ KSQPDIQQLIRAAPSTLTTPGEIIKYVLDRQKTAPLTDQGIAAAMSSAIQPLVMAVVN RERMASELAMSGSDLPTSPLAMEYVNDFDLMKFEVKKEPVETDRIISQCGRLIAGGSL SSTPMSTPCSSVPPSPSFSAPSPGSGTDQKTHLEDYYWMTGYPQQLNPEALGFSPEDA VEALINSSHHPLPGAFDGYARGQQLAAAAGGSVPAEEMGSAAAVVSAVIAAAAAQGGA PHYHHHHHHPHHGGGGGGGGHPHGAAPGSAPPSSASSSAAGSGGGGGGGGGGAGGLHH PHHGGGGGGGGLHFDDRFSDEQLVTMSMRELNRQLRGVSKEEVIRLKQKRRTLKNRGY AQSCRFKRVQQRHVLESEKNQLLQQVEHLKQEISRLVRERDAYKEKYEKLVSNGFREN GSSSDNPSSPEFFMYPRESSTTVM" mRNA 2898..>3171 /note="env mRNA" BASE COUNT 863 a 839 c 848 g 621 t ORIGIN 1 ccggccctga ctgactgggc aaggatcagg gaggagcttg cgagtacagg tccgcccgtg 61 gtggccatgc ctgtagtgat taagacagag ggacccgcct ggacccctct ggagccaaaa 121 ttgatcacaa gactggctga tacggtcagg accaagggct tacgatcccc gatcactatg 181 gcagaagtgg aagcgcttat gtcctccccg ctgctgccgc atgacgtcac gaatctaatg 241 agagttattt taggacctgc cccatatgcc ttatggatgg acgcttgggg agtccaactg 301 cagacggtta tagcggcagc cactcgcgac ccccggcacc cggcgaatgg ccaagggcgg 361 ggggaacgga ccaatttaga tcgtttaaag ggattggcgg atggaatggt tggcaatcca 421 gatggtcagg ctgccttatt aagaccaggg gaactggttg ccattacagc gtcggctctc 481 caggcgttta gagaggtcgc tcggttggcg gaacccacgg acccgtgggc ggaaatcaca 541 cagggaccat ctgagtcctt tgtggatttt gctaatcgtc ttataaaagc ggtcgagggc 601 tcagatctcc cgcctaccgc gcgggctccg gtgatcattg actgctttag gcagaagtca 661 cagccagata ttcagcagct tatacgggca gcaccctcca cactgaccac cccaggagaa 721 ataatcaaat atgtgctaga taggcagaag actgcccctc ttacggatca aggcatagcc 781 gcggccatgt cgtctgctat ccagccctta gttatggcag tagtcaatag agaaaggatg 841 gcatcagaac tggcaatgag cggctccgac ctgcccacca gtcccctggc catggaatat 901 gttaatgact tcgatctgat gaagtttgaa gtgaaaaagg agccggtgga gaccgatcgc 961 attatcagcc agtgcggccg cttgatcgcc gggggatcgc tctcttccac cccgatgagc 1021 acgccctgca gctcggtgcc cccgtccccc agcttctcgg cgcccagccc cggctccggc 1081 accgaccaga agacccacct ggaagactac tactggatga cgggctaccc gcagcagctc 1141 aaccccgagg cgctgggctt cagccccgag gacgcggtgg aggcgctgat caacagcagc 1201 caccacccgc tgcccggcgc cttcgatggc tatgctagag ggcagcagct ggccgcggcc 1261 gccggcggct cggtgccggc cgaggagatg ggctcggcgg ccgccgtggt gtcggcggtg 1321 atcgccgcgg cggcggcgca gggcggcgcg ccccactacc accaccacca ccaccacccg 1381 caccacggcg gcggcggcgg cggcggcgga cacccccacg gcgcggcgcc gggcagcgcg 1441 ccgccctcct ccgcctcctc ctcggccgcg ggctccggag gcggcggcgg cggcggcggc 1501 ggaggcgccg gggggctgca ccacccgcac cacggaggag gcggcggcgg cggcggcctg 1561 cacttcgacg accgcttctc cgacgagcag ctggtcacca tgtcgatgcg ggagctgaac 1621 cggcagctgc ggggcgtcag caaggaagag gtgatccggc tgaagcagaa gaggaggacc 1681 ctcaaaaaca ggggctatgc ccagtcctgc cgcttcaaga gggtccagca gcggcacgtc 1741 ctggagtcgg agaagaacca gctgctgcag caagtggagc acctaaagca ggagatctcc 1801 aggctggtcc gggagaggga cgcctacaag gaaaagtacg agaagctggt cagcaatggc 1861 ttccgagaaa acggatccag cagcgacaac ccctcctctc cagagttttt catgtacccg 1921 agagaatctt ctacaacggt gatgtgaaaa tccccgttac ctgaggtcag aacaaaagaa 1981 aatgtgatgc tggactgatg atggcttagg acaacgttct ggcacgaaga cagctacaaa 2041 gcacaaaata caaaatacaa aaaaaaaaaa aaaaaaaaaa aagaactcag aaagaaacaa 2101 gaaaaaaaca gcattctctt ggtatcaacg agaagctctt tatttttcct caagttttct 2161 tttctaaaaa aaaaaaaaaa aaaaaaaaaa aaaaattact aagacatttg cagaaatgag 2221 acaaactttc tcttcaggtg acggtattca agcaagcatg ttgttttatt aaatttaaag 2281 aaaaatagag aaatttggga tacagtatca ggagaactga gaaactgtga attttccatg 2341 aaaaaaaaac cacaacaaac tccttaaaac tgtgaactga gaaatggatg gaaataccac 2401 ttaaccccct gattacaccc tgtgcgttct ttaggagaca acatcatctt tgctaactct 2461 atgatattag ggaatggttt acaaaatatc ttttattata atcttttaag acaaaagaaa 2521 tatttaatta agctgtgatt ttgacatttc tgttcagaag tcttcttgca ccctctgttt 2581 ttagatagca tggatgtatt tgcagccagc agtcaagggc aggagagatt tctttcctta 2641 ataatctttt tattcgccac accctacaaa atccttgcag ggaaaaggaa tcaacaccgc 2701 taccatatcc cgatcgatac ctacagccta acagacagta attatcaaaa taaagaggaa 2761 atcccatatt gggctacata cctaacccat taatcaaaac aacagagaat atctgtggca 2821 ccgcgtttct gcaatagcac aaagcatgcc tccttgggtt acactttgaa caggctataa 2881 tcaaaagaag agaggcacgc agcccccgcc tgggttctcc tgaaacccag tgcataagga 2941 gaggaggtaa atgggttaac caatcacggg agattaatga aacggagccg ttcagcttta 3001 cggtgaactg tacagctagt aatttgcgta atgccagtgg gtgctgcgga aaaacaggta 3061 tgatccttcc aggggtatgg atcgacagca caaaaggtaa tttcaccaaa ccaaaagcgc 3121 taccacccgg aattttcctc atttgtgggg atcgcgcatg gcaaggaatt c // LOCUS AALRRVT4 524 bp RNA VRL 05-APR-1992 DEFINITION Ross river virus T48 genomic RNA 3' untranslated sequence. ACCESSION X04311 VERSION X04311.1 GI:58421 KEYWORDS . SOURCE Ross River virus. ORGANISM Ross River virus Viruses; ssRNA positive-strand viruses, no DNA stage; Togaviridae; Alphavirus. REFERENCE 1 (bases 1 to 524) AUTHORS Faragher,S.G. and Dalgarno,L. TITLE Regions of conservation and divergence in the 3' untranslated sequences of genomic RNA from Ross River virus isolates JOURNAL J. Mol. Biol. 190 (2), 141-148 (1986) MEDLINE 87086753 FEATURES Location/Qualifiers source 1..524 /organism="Ross River virus" /strain="T48" /db_xref="taxon:11029" BASE COUNT 216 a 87 c 81 g 139 t 1 others ORIGIN 1 taagctttag ttcaaagggc catataaacc cctgaatagt aacaaaatat aaaaattaca 61 aaatatgtag ttcaaagggc tacactaccc ctgattagta acaaaataga aaaccacaaa 121 atatgtagtt aagtattata agatgtgtag ttcaaagggc natatcaccc ctgattagta 181 acaaaatata aaaacaaaaa tatgtagtta agtactaacc aacaagtaga caaatagatg 241 ctaaccatat atataaccag ctatagtata ctatatttag ctaagcagtt gcagtagtaa 301 gaatgtagtt caaagggcta tacaacccct gaatagtaac aaaatacaaa aatactaata 361 aaaatttaaa aatcactaga aatccaatca ttaaattatt aattggctag ccgaactcta 421 aggagatgta ggcgtccgaa ctctgcggag atgtaggact aaattctgcc gaaccccata 481 acaccgggga cgtaggcgtc taatttgttt ttttaatatt ttac //