ID HSIGHAF standard; mRNA; HUM; 1089 BP. XX AC J00231; XX SV J00231.1 XX DT 13-JUN-1985 (Rel. 06, Created) DT 04-MAR-2000 (Rel. 63, Last updated, Version 8) XX DE Human Ig gamma3 heavy chain disease OMM protein mRNA. XX KW C-region; gamma heavy chain disease protein; KW gamma3 heavy chain disease protein; heavy chain disease; hinge exon; KW immunoglobulin gamma-chain; immunoglobulin heavy chain; KW secreted immunoglobulin; V-region. XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Primates; Catarrhini; Hominidae; Homo. XX RN [1] RP 1-1089 RX MEDLINE; 82247835. RX PUBMED; 6808505. RA Alexander A., Steinmetz M., Barritault D., Frangione B., Franklin E.C., RA Hood L., Buxbaum J.N.; RT "gamma Heavy chain disease in man: cDNA sequence supports partial gene RT deletion model"; RL Proc. Natl. Acad. Sci. U.S.A. 79(10):3260-3264(1982). XX DR GDB; 119339; IGHG3. DR GOA; P01860. DR IMGT/LIGM; J00231; J00231. DR SWISS-PROT; P01860; GC3_HUMAN. XX CC The protein isolated from patient OMM is a gamma heavy chain CC disease (HCD) protein. It has a large 5' internal deletion CC consisting of most of the variable region and the entire ch1 CC domain. [1] suggests that the protein abnormality is from a partial CC gene deletion rather than from defective splicing. XX FH Key Location/Qualifiers FH FT source 1..1089 FT /db_xref="taxon:9606" FT /mol_type="mRNA" FT /organism="Homo sapiens" FT /map="14q32.33" FT mRNA <1..1089 FT /note="gamma3 mRNA" FT CDS 23..964 FT /codon_start=1 FT /db_xref="GOA:P01860" FT /db_xref="SWISS-PROT:P01860" FT /note="OMM protein (Ig gamma3) heavy chain" FT /gene="IGHG3" FT /protein_id="AAA52805.1" FT /translation="MKXLWFFLLLVAAPRWVLSQVHLQESGPGLGKPPELKTPLGDTTH FT TCPRCPEPKSCDTPPPCPRCPEPKSCDTPPPCPRCPEPKSCDTPPPCPXCPAPELLGGP FT SVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPXVQFKWYVDGVEVHNAKTKLREEQYN FT STFRVVSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPXXXXXXXXXXXXE FT EMTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYNTTPPMLDSDGSFFLYSKLTVDKS FT RWQQGNIFSCSVMHEALHNRYTQKSLSLSPGK" FT sig_peptide 26..79 FT /note="OMM protein signal peptide" FT /gene="IGHG3" FT mat_peptide 80..961 FT /note="OMM protein mature peptide" FT /gene="IGHG3" XX SQ Sequence 1089 BP; 240 A; 358 C; 271 G; 176 T; 44 other; cctggacctc ctgtgcaaga acatgaaaca nctgtggttc ttccttctcc tggtggcagc 60 tcccagatgg gtcctgtccc aggtgcacct gcaggagtcg ggcccaggac tggggaagcc 120 tccagagctc aaaaccccac ttggtgacac aactcacaca tgcccacggt gcccagagcc 180 caaatcttgt gacacacctc ccccgtgccc acggtgccca gagcccaaat cttgtgacac 240 acctccccca tgcccacggt gcccagagcc caaatcttgt gacacacctc ccccgtgccc 300 nnngtgccca gcacctgaac tcttgggagg accgtcagtc ttcctcttcc ccccaaaacc 360 caaggatacc cttatgattt cccggacccc tgaggtcacg tgcgtggtgg tggacgtgag 420 ccacgaagac ccnnnngtcc agttcaagtg gtacgtggac ggcgtggagg tgcataatgc 480 caagacaaag ctgcgggagg agcagtacaa cagcacgttc cgtgtggtca gcgtcctcac 540 cgtcctgcac caggactggc tgaacggcaa ggagtacaag tgcaaggtct ccaacaaagc 600 cctcccagcc cccatcgaga aaaccatctc caaagccaaa ggacagcccn nnnnnnnnnn 660 nnnnnnnnnn nnnnnnnnnn nnnnngagga gatgaccaag aaccaagtca gcctgacctg 720 cctggtcaaa ggcttctacc ccagcgacat cgccgtggag tgggagagca atgggcagcc 780 ggagaacaac tacaacacca cgcctcccat gctggactcc gacggctcct tcttcctcta 840 cagcaagctc accgtggaca agagcaggtg gcagcagggg aacatcttct catgctccgt 900 gatgcatgag gctctgcaca accgctacac gcagaagagc ctctccctgt ctccgggtaa 960 atgagtgcca tggccggcaa gcccccgctc cccgggctct cggggtcgcg cgaggatgct 1020 tggcacgtac cccgtgtaca tacttcccag gcacccagca tggaaataaa gcacccagcg 1080 ctgccctgg 1089 // ID HSFOS standard; genomic DNA; HUM; 6210 BP. XX AC K00650; M16287; XX SV K00650.1 XX DT 26-JUL-1991 (Rel. 28, Created) DT 02-JUL-1999 (Rel. 60, Last updated, Version 3) XX DE Human fos proto-oncogene (c-fos), complete cds. XX KW c-myc proto-oncogene; fos oncogene; proto-oncogene. XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Primates; Catarrhini; Hominidae; Homo. XX RN [1] RP 1-4165 RX MEDLINE; 83221560. RX PUBMED; 6574479. RA van Straaten F., Muller R., Curran T., Van Beveren C., Verma I.M.; RT "Complete nucleotide sequence of a human c-onc gene: deduced amino acid RT sequence of the human c-fos protein"; RL Proc. Natl. Acad. Sci. U.S.A. 80(11):3183-3187(1983). XX RN [2] RX MEDLINE; 86028185. RX PUBMED; 2414012. RA Treisman R.; RT "Transient accumulation of c-fos RNA following serum stimulation requires a RT conserved 5' element and c-fos 3' sequences"; RL Cell 42(3):889-902(1985). XX RN [3] RP 4166-6210 RX MEDLINE; 87217118. RA Verma I.M., Deschamps J., Van Beveren C., Sassone-Corsi P.; RT "Human fos gene"; RL Cold Spring Harb. Symp. Quant. Biol. 51:0-0(0). XX DR EPD; EP11145; HS_FOS. DR GDB; 119917; FOS. DR GOA; P01100. DR SWISS-PROT; P01100; FOS_HUMAN. DR TRANSFAC; R00458; HS$CFOS_01. DR TRANSFAC; R00459; HS$CFOS_02. DR TRANSFAC; R00460; HS$CFOS_03. DR TRANSFAC; R00461; HS$CFOS_04. DR TRANSFAC; R00463; HS$CFOS_06. DR TRANSFAC; R00464; HS$CFOS_07. DR TRANSFAC; R00465; HS$CFOS_08. DR TRANSFAC; R00466; HS$CFOS_09. DR TRANSFAC; R00467; HS$CFOS_10. DR TRANSFAC; R00468; HS$CFOS_11. DR TRANSFAC; R00470; HS$CFOS_13. DR TRANSFAC; R00471; HS$CFOS_14. DR TRANSFAC; R01640; HS$CFOS_15. DR TRANSFAC; R01889; HS$CFOS_16. DR TRANSFAC; R03425; HS$CFOS_17. DR TRANSFAC; R04046; HS$CFOS_18. DR TRANSFAC; R04047; HS$CFOS_19. DR TRANSFAC; T00123; T00123. XX CC [2] sites; promoter region. CC C-fos is the human cellular homolog of the v-fos oncogene of CC Finkel-Biskis-Jinkins murine osteosarcoma virus (FBJ-MuSV). [2] It CC was found that both human and murine c-fos genes contained an CC enhancer-like element in their 5' noncoding regions that was CC necessary for increased transcription following serum activation. CC The FBJ-MuSV v-fos oncogene contains a deletion relative to murine CC and human c-fos proto-oncogenes that causes complete divergence of CC the COOH terminal protein sequences encoded. That deletion CC corresponds to positions 3182-3285 inclusive of this sequence. The CC FBJ-MuSV v-fos sequence is more closely related to murine than CC human c-fos sequences. The FBJ-MuSV v-fos coding sequence ends at CC a 'tag' stop codon coresponding to positions 3434-2436 of this CC sequence [1]. [1] notes two alu repeats beginning aproximately 500 CC and 1700 nucleotides downstream of the last base in this sequence. CC A TATA box is located at positions 701-707. Two potential CC polyadenylation signals are present in the 3' untranslated region. XX FH Key Location/Qualifiers FH FT source 1..6210 FT /db_xref="taxon:9606" FT /mol_type="genomic DNA" FT /organism="Homo sapiens" FT /map="14q24.3" FT misc_feature 402..453 FT /note="transcriptional activator region [2]" FT prim_transcript 734..>3329 FT /note="c-fos mRNA [1]" FT gene 889..1029 FT /gene="FOS" FT CDS join(889..1029,1783..2034,2466..2573,2688..3329) FT /codon_start=1 FT /db_xref="GOA:P01100" FT /db_xref="SWISS-PROT:P01100" FT /note="c-fos protein" FT /protein_id="AAA52471.1" FT /translation="MMFSGFNADYEASSSRCSSASPAGDSLSYYHSPADSFSSMGSPVN FT AQDFCTDLAVSSANFIPTVTAISTSPDLQWLVQPALVSSVAPSQTRAPHPFGVPAPSAG FT AYSRAGVVKTMTGGRAQSIGRRGKVEQLSPEEEEKRRIRRERNKMAAAKCRNRRRELTD FT TLQAETDQLEDEKSALQTEIANLLKEKEKLEFILAAHRPACKIPDDLGFPEEMSVASLD FT LTGGLPEVATPESEEAFTLPLLNDPEPKPSVEPVKSISSMELKTEPFDDFLFPASSRPS FT GSETARSVPDMDLSGSFYAADWEPLHSGSLGMGPMATELEPLCTPVVTCTPSCTAYTSS FT FVFTYPEADSFPSCAAAHRKGSSSNEPSSDSLSSPTLLAL" FT exon <889..1029 FT /note="c-fos protein; G00-119-917" FT /number=1 FT /gene="FOS" FT intron 1030..1782 FT /note="c-fos intron A" FT exon 1783..2034 FT /number=2 FT intron 2035..2465 FT /note="c-fos intron B" FT exon 2466..2573 FT /number=3 FT intron 2574..2687 FT /note="c-fos intron C" FT exon 2688..>3329 FT /note="c-fos protein" FT /number=4 XX SQ Sequence 6210 BP; 1497 A; 1571 C; 1619 G; 1523 T; 0 other; gcaggaacag tgctagtatt gctcgagccc gagggctgga ggttagggga tgaaggtctg 60 cttccacgct ttgcactgaa ttagggctag aattggggat gggggtaggg gcgcattcct 120 tcgggagccg aggcttaagt cctcggggtc ctgtactcga tgccgtttct cctatctctg 180 agcctcagaa ctgtcttcag tttccgtaca agggtaaaaa ggcgctctct gccccatccc 240 ccccgacctc gggaacaagg gtccgcattg aaccaggtgc gaatgttctc tctcattctg 300 cgccgttccc gcctcccctc ccccagccgc ggcccccgcc tccccccgca ctgcaccctc 360 ggtgttggct gcagcccgcg agcagttccc gtcaatccct ccccccttac acaggatgtc 420 catattagga catctgcgtc agcaggtttc cacggccttt ccctgtagcc ctggggggag 480 ccatccccga aacccctcat cttggggggc ccacgagacc tctgagacag gaactgcgaa 540 atgctcacga gattaggaca cgcgccaagg cgggggcagg gagctgcgag cgctggggac 600 gcagccgggc ggccgcagaa gcgcccaggc ccgcgcgcca cccctctggc gccaccgtgg 660 ttgagcccgt gacgtttaca ctcattcata aaacgcttgt tataaaagca gtggctgcgg 720 cgcctcgtac tccaaccgca tctgcagcga gcaactgaga agccaagact gagccggcgg 780 ccgcggcgca gcgaacgagc agtgaccgtg ctcctaccca gctctgcttc acagcgccca 840 cctgtctccg cccctcggcc cctcgcccgg ctttgcctaa ccgccacgat gatgttctcg 900 ggcttcaacg cagactacga ggcgtcatcc tcccgctgca gcagcgcgtc cccggccggg 960 gatagcctct cttactacca ctcacccgca gactccttct ccagcatggg ctcgcctgtc 1020 aacgcgcagg taaggctggc ttcccgtcgc cgcggggccg ggggcttggg gtcgcggagg 1080 aggagacacc gggcgggacg ctccagtaga tgagtagggg gctcccttgt gcctggaggg 1140 aggctgccgt ggccggagcg gtgccggctc gggggctcgg gacttgctct gagcgcacgc 1200 acgcttgcca tagtaagaat tggttccccc ttcgggaggc aggttcgttc tgagcaacct 1260 ctggtctgca ctccaggacg gatctctgac attagctgga gcagacgtgt cccaagcaca 1320 aactcgctaa ctagagcctg gcttcttcgg ggaggtggca gaaagcggca atcccccctc 1380 ccccggcagc ctggagcacg gaggagggat gagggaggag ggtgcagcgg gcgggtgtgt 1440 aaggcagttt cattgataaa aagcgagttc attctggaga ctccggagcg gcgcctgcgt 1500 cagcgcagac gtcagggata tttataacaa accccctttc aagcaagtga tgctgaaggg 1560 ataacgggaa cgcagcggca ggatggaaga gacaggcact gcgctgcgga atgcctggga 1620 ggaaaagggg gagacctttc atccaggatg agggacattt aagatgaaat gtccgtggca 1680 ggatcgtttc tcttcactgc tgcatgcggc actgggaact cgccccacct gtgtccggaa 1740 cctgctcgct cacgtcggct ttccccttct gttttgttct aggacttctg cacggacctg 1800 gccgtctcca gtgccaactt cattcccacg gtcactgcca tctcgaccag tccggacctg 1860 cagtggctgg tgcagcccgc cctcgtctcc tctgtggccc catcgcagac cagagcccct 1920 caccctttcg gagtccccgc cccctccgct ggggcttact ccagggctgg cgttgtgaag 1980 accatgacag gaggccgagc gcagagcatt ggcaggaggg gcaaggtgga acaggtgagg 2040 aactctagcg tactcttcct gggaatgtgg gggctgggtg ggaagcagcc ccggagatgc 2100 aggagcccag tacagaggat gaagccactg atggggctgg ctgcacatcc gtaactggga 2160 gccctggctc caagcccatt ccatcccaac tcagactctg agtctcaccc taagaagtac 2220 tctcatagtt tcttccctaa gtttcttacc gcatgctttc agactgggct cttctttgtt 2280 ctcttgctga ggatcttatt ttaaatgcaa gtcacaccta ttctgcaact gcaggtcaga 2340 aatggtttca cagtggggtg ccaggaagca gggaagctgc aggagccagt tctactgggg 2400 tgggtgaatg gaggtgatgg cagacacttt tactgaatgt cggtcttttt ttgtgattat 2460 tctagttatc tccagaagaa gaagagaaaa ggagaatccg aagggaaagg aataagatgg 2520 ctgcagccaa atgccgcaac cggaggaggg agctgactga tacactccaa gcggtaggta 2580 ctctgtgggt tgctcctttt taaaacttaa gggaaagttg gagattgagc ataagggccc 2640 ttgagtaaga ctgtgtctta tgctttcctt tatccctctg tatacaggag acagaccaac 2700 tagaagatga gaagtctgct ttgcagaccg agattgccaa cctgctgaag gagaaggaaa 2760 aactagagtt catcctggca gctcaccgac ctgcctgcaa gatccctgat gacctgggct 2820 tcccagaaga gatgtctgtg gcttcccttg atctgactgg gggcctgcca gaggttgcca 2880 ccccggagtc tgaggaggcc ttcaccctgc ctctcctcaa tgaccctgag cccaagccct 2940 cagtggaacc tgtcaagagc atcagcagca tggagctgaa gaccgagccc tttgatgact 3000 tcctgttccc agcatcatcc aggcccagtg gctctgagac agcccgctcc gtgccagaca 3060 tggacctatc tgggtccttc tatgcagcag actgggagcc tctgcacagt ggctccctgg 3120 ggatggggcc catggccaca gagctggagc ccctgtgcac tccggtggtc acctgtactc 3180 ccagctgcac tgcttacacg tcttccttcg tcttcaccta ccccgaggct gactccttcc 3240 ccagctgtgc agctgcccac cgcaagggca gcagcagcaa tgagccttcc tctgactcgc 3300 tcagctcacc cacgctgctg gccctgtgag ggggcaggga aggggaggca gccggcaccc 3360 acaagtgcca ctgcccgagc tggtgcatta cagagaggag aaacacatct tccctagagg 3420 gttcctgtag acctagggag gaccttatct gtgcgtgaaa cacaccaggc tgtgggcctc 3480 aaggacttga aagcatccat gtgtggactc aagtccttac ctcttccgga gatgtagcaa 3540 aacgcatgga gtgtgtattg ttcccagtga cacttcagag agctggtagt tagtagcatg 3600 ttgagccagg cctgggtctg tgtctctttt ctctttctcc ttagtcttct catagcatta 3660 actaatctat tgggttcatt attggaatta acctggtgct ggatattttc aaattgtatc 3720 tagtgcagct gattttaaca ataactactg tgttcctggc aatagtgtgt tctgattaga 3780 aatgaccaat attatactaa gaaaagatac gactttattt tctggtagat agaaataaat 3840 agctatatcc atgtactgta gtttttcttc aacatcaatg ttcattgtaa tgttactgat 3900 catgcattgt tgaggtggtc tgaatgttct gacattaaca gttttccatg aaaacgtttt 3960 attgtgtttt taatttattt attaagatgg attctcagat atttatattt ttattttatt 4020 tttttctacc ttgaggtctt ttgacatgtg gaaagtgaat ttgaatgaaa aatttaagca 4080 ttgtttgctt attgttccaa gacattgtca ataaaagcat ttaagttgaa tgcgaccaac 4140 cttgtgctct tttcattctg gaagtcttgt aagtttctga aaggtattat tggagaccag 4200 tttgtcaaga agggtagctg ctggaggggg acacaccctc tgtctgatcc cttatcaaag 4260 aggacaagga aactatagag ctgattttag aatattttac aaatacatgc cttccattgg 4320 aatgctaaga ttttctactg cttctgggga cgggaaaccg ctgtgtaaca gcttttgtgg 4380 gaatacattt tttctgtttc agtactcgca gggggaaata tttaaatttt gttgtgctaa 4440 tattaaattc agatgttttg atcttaaagg aaccctttaa gcaaacagaa cctagctttg 4500 tacagactat tttaactttt tattctcaca aaatcacgtg gagggttatt ctacttcaaa 4560 gatgagcaaa ttgaagaatg gttagaataa acaactttct tgatattccg ttatcggcat 4620 tagaatcttc ctgctcgtta tcgtatccag caggctgaac tgcctcttga tacttggtta 4680 aaaaaaattt tcaggccggg cgcggtggcc catgcctgta atcctagcac tttgggaggc 4740 cgaggcaggc ggatcacctg aggtcgggag ttcgagacca gcctgaccaa catggagaaa 4800 ccccgtcttt actaaaaata caaaattagc ctggtgtggt ggtgcatgcc tgtaatccta 4860 gctacttgag aggctgagac aggaaaatca cttgaactcg ggaggcggat gttgcagcga 4920 actgagattg cgccattgca ctccagcctg ggcaacaaga ttgaaactct gtttaaaaaa 4980 aaaagttttc actaatgtgt acattttttt gtactctttt attctcgaaa gggaaggagg 5040 gctattgccc tatcccttat taataaatgc attgtggttt ctggtttctc taataccata 5100 tgcccttcat tcagtttata gtgggcggaa gtgggggaga aaaagttgct cagaaatcaa 5160 aagatatctc aaacagcaca aataatggct gatcgttctg caaacaaaaa gttacataat 5220 agctcaagaa ggagaagtca acatgactct gaacaagctt taacttagaa actttatcat 5280 cttaaggaag aacgtgacct ttgtccagga cgtctctggt aatggggcac ttacacacac 5340 atgcacacgt acaaaccaca gggaaaggag accgcccttc tgcctctgct cgcgagtatc 5400 acgcaggcac catgcactat gttttcacac acactgggtg gaagaagagc ttcagcgcca 5460 gtcttctaat gctttggtga taatgaaaat cactgggtgc ttatggggtg tcatattcaa 5520 tcgagttaaa agttttaatt caaaatgaca gttttactga ggttgatgtt ctcgtctatg 5580 atatctctgc ccctcccata aaaatggaca tttaaaagca acttaccgct ctttagatca 5640 ctcctatatc acacaccact tggggtgctg tttctgctag acttgtgatg acagtggcct 5700 taggatccct gtttgctgtt caaagggcaa atattttata gcctttaaat atacctaaac 5760 taaatacaga attaatataa ctaacaaaca cctggtctga aataacaagg tgatctaccc 5820 tggaaggaac ccagctggtg ggccaggagc ggtggctcac acctgtaatt ccagcacttt 5880 gggaggctga gacaggagga tcactggagt ccaggagttt gagaccagcc tgggcaacat 5940 ggcaaaaccc agtgtgcttc tgttgtccca gctacactac tcaggaggct gaggcaggag 6000 tatgacttga gcctgggagg gggaggttgc agagaactga tattgcacca ccactgcact 6060 ccagcctggg tgacacagca aaaccctatc tcaaaaaaaa aaaaaaaaaa aaggaaccca 6120 gctggttcct gtaggtgtgc aataataaca accagaggaa gaaaaggaag acgatttccc 6180 agatgaagaa gggcagctgg accttcggac 6210 // ID BUM standard; genomic RNA; VRL; 200 BP. XX AC J02231; XX SV J02231.1 XX DT 29-AUG-2003 (Rel. 77, Created) DT 29-AUG-2003 (Rel. 77, Last updated, Version 1) XX DE La Crosse virus isolate l74 m-rna 3' sequence. XX KW . XX OS La Crosse virus OC Viruses; ssRNA negative-strand viruses; Bunyaviridae; Orthobunyavirus; OC California encephalitis virus group. XX RN [1] RP 1-200 RX MEDLINE; 82216937. RX PUBMED; 7086954. RA Clerx-van Haaster C.M., Akashi H., Auperin D.D., Bishop D.H.L.; RT "Nucleotide sequence analyses and predicted coding of bunyavirus genome RNA RT species"; RL J. Virol. 41(1):119-128(1982). XX FH Key Location/Qualifiers FH FT source 1..200 FT /db_xref="taxon:11577" FT /mol_type="genomic RNA" FT /organism="La Crosse virus" XX SQ Sequence 200 BP; 68 A; 34 C; 47 G; 51 T; 0 other; agtagtgtac taccaagtat agataacgtt taaatattaa agttttggat caaagccaaa 60 gatgattcgc atgctggtgc tgattgtagt tacagctgca agcccagtgt atcagagatg 120 tttccaagat ggggctatag tgaagcaaaa cccatccaaa gaggcagtca cagaagtgtc 180 cctaaaagat gatgttagca 200 //