;ID   ATLINE1_4   DNA   ; ATH   ; 6700 BP
;XX
;DE   ATLINE1_4, non-LTR retrotransposon - a fossil.
;XX
;AC   AC006248
;XX
;DT   30-NOV-2001 (Rel. 6.3, Created)
;DT   30-NOV-2001 (Rel. 6.3, Last updated, Version 1)
;XX
;KW   non-LTR retrotransposon; L1 superfamily; poly(A) tail; ORF1; ORF2; 
;KW   reverse transcriptase; ATLINE1_4.
;XX
;OS   thale cress
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
;OC   Dilleniidae; Capparales; Brassicaceae.
;XX
;RN   [1]  (bases 1 to 6700)
;RA   Kapitonov,V. and Jurka,J.
;RT   ATLINE1_4, a non-LTR retrotransposon.
;RL   Repbase Reports 1:(3) p. 33 (2001)
;XX
;CC   ATLINE1_4 is a non-LTR retrotransposon. Its individual copies are 
;CC   ~88% identical to each other. There are only 3 copies of 
;CC   ATLINE1_4 present in the genome. ATLINE1_4 belongs to the L1
;CC   superfamily of non-LTR retrotransposons, its copies are flanked
;CC   by ~15-bp target site duplications. 
;CC   Two proteins, ATLINE1_4p1 and ATLINE1_4p2, are encoded by ORF1 
;CC   (position 389-2380) and ORF2 (position 2551-6609), respectively. 
;CC   ATLINE1_4p2 contains the reverse transcriptase and endonuclease 
;CC   domains.
;CC   ATLINE1_4p1 (664 aa):
;CC   MSKRIRPSWYRESPPKQPPFAFEPEEEDDVVILPQVDNSALLARLHLSLVGRMFHQGGRSTKALLSFLPK
;CC   ENIWDVEGRVRGVSLGDARFQFFFESEVDLQKVLNKRPCHFNKWSFALERWEPHVGTSFPNIMTFWVRTE
;CC   GIPAEFWDEEVLRNFGNSLGLVRRVDPSKGRILISVTADVPLRFNKNAQLPSGTVVKVKLSYEKLFRWCS
;CC   YCRRICHELEQCPLLDAEQKAVLSAEESQRNLRLSLRDGDSSQARLPLQSFPESNRSRSERNHLPLLNGP
;CC   PSSRPYHSRGVIRREENNLASRSGRNRSHRQPYPASLNQYPAEHASKLNRPHGAVSHKRPASPVVTRNAG
;CC   VDDGRKRRFGDSFSKEKSSAPPNKQSSPLLSDSQLTLSDTVLAPSRAKQITSSPTYVRERPFRLNLSKKA
;CC   SALEKGKGKVVEHPTPLLGESSVVGSSAKKSLNFEPSEPAPKNLDTPISVTSKSLEPQLEKRKSWYDMTV
;CC   EEDEATARSLESGPDTILAAKFSQVVSFASPSVVSPSLSVLPSPAAPEDEWNESLNPLSEALNLDWTEED
;CC   EAAYHLADDLDVDADDLLSEELQESLQDQSGLVIPGSHVAPISELQPEKLKSLIVSSLQSEEVGPSIAIP
;CC   RRKDSKKKVVPHSRKAQLNSGLCLNLASKKIHHI
;CC   ATLINE1_4p1 (1354 aa, two false frame-shifts are corrected based on
;CC   alignments with related proteins):
;CC   MRLISWNCQGVGPKTTSRRLEEMCRMYSPGFLFLSETKNDLVYLQNVQVSLGFDCLKTVEPIGNSGGLAL
;CC   FYSRDYPVKFIYVCDRLIDIETIIDGNRVFITFVYGDPVVQYRELVWKRLTRIGIVRSEPWFMIGDFNEI
;CC   IGNHEKRGGKKRSESSFLPFCCMIENCGMIDFPSTGSLFSWVGKRSCGVAGRKRRDLIKCRLDRAMGNEE
;CC   WHSIYSHTNVEYLQHRGSDHKPLLASIQNKPYRPYKHFIFDKRWINKPGFKESVQEGWAFPSRGEGVPFF
;CC   QKIKNCRQTISIWKKSNKTNTEKLILELHSQLDLAYEDENFSTEDLLALKWKLCQAYRDEEIFWKLKSRE
;CC   IWLQLGDMNTKFFHASVHKQRRARNKILGLLNQDGLWVDNEVGVEHLAENYFETLFTTSDPQVFDSALQE
;CC   VPVLITEEMNKSLTKVISPEEVKRALFSLNPDKAPGPDGMTAFFYQHYWDLTGPDLIKLVQNFHSTGFFD
;CC   ERLNETNICLIPKTERPRKMAEFRPISLCNVSYKVISKVLSSRLKRLLPELISETQSAFVAERLITDNIL
;CC   IAQENFHALRTNPACKKKYMAIKTDMSKAYDRVEWSFLRALMLKMGFAQKWVDWIIFCISSVSYKILLNG
;CC   SPKGFIKPSRGIRQGDPISPFLFILCTEALVAKLKDAEWHGRIQGLQISRASPSTSHLLFADDSLFFCKA
;CC   DPLQGKEIIDILRLYGEASGQQLNPDKSSVMFGHEVDNSIRNTIKVSLGIHKDGGMGSYLGLPKQIHGSK
;CC   TQVFSFVRDRLQKRINGWTSKFLSKGGKEILIKSVAQALPTYVMSCFLLPKAIRSKLSSVVANFWWKTRE
;CC   ESNGIHWIAWDKLCTPFSDGGLGFRTLEEFNLVLLAKQLWRLIRFPNSLLSRVLRGRYFRYSDPIQIGKA
;CC   NRPSFGWRSIMAAKPLLLSGLRRTIGSGMLTRVWEDPWIPSFPPRPAKSILNIRDTHLYVNDLIDPVTKQ
;CC   WKLGRLQELVDPSDIPLILGIRPSRTYKSDDFSWSFTKSGNYTVKSGYWAARDLSRPTCDLPFQGPSVSA
;CC   LQAQVWKIKTTRKFKHFEWQCLSGCLATNQRLFSRHIGTEKVCPRCGAEEESINHLLFLCPPSRQIWALS
;CC   PIPSSEYIFPRNSLFYNFDFLLSRGKEFDIAEDIMEIFPWILWYIWKSRNRFIFENVIESPQVILDFAIQ
;CC   EANVWKQANSKEVATEYPPPQVVPANLPPTRNVCQFDASWHLKDTLSGHGWVLVDQDIVLLLGLKSARKS
;CC   LSPLHAEVDSLLWAMECMISLGVSDCSFASDSADFISLLENPSEWPTFVAELATFSSLVCFFPSFSIKFF
;CC   SRIYNVRADCLSKKARARNSLFSM
;XX
;DR   Positions 35574  28875  Accession No AC006248   GenBank (rel. 124.0)
;XX
;SQ   Sequence 6700 BP; 1793 A; 1441 C; 1452 G; 2014 T; 0 other;
ATLINE1_4
gacggattatattataccaaaatgttatctttctttcttgactcttttctctttcaggcacagtctccga
gataactgaaagaggcaaatgggttcaaaattagtagatctgaactaatttttttagctattttgggaaa
ttttccctcttttcttgccggagtctcctttttctcttctcccctcatggttttcttctctgtttctttt
agccacagactttgcttctcatctgctctctctcaagaattcttctgctgagcttttgtgaatgcaaaga
aaggctctcatctcctctgaagtttctcctgcaatagcagagtttttgcttgtttctagcttttagctaa
gtctgtgtgtgttattacacatctctcctctgcttaccatgtcgaagagaattcgcccaagctggtacag
agagtcccctccaaaacaacctccttttgcttttgaacctgaagaagaagatgatgttgtcattctccct
caagtggataattcagcgctccttgctcgtctccatctcagcctcgtggggagaatgttccaccaaggag
gacgcagtactaaagcccttctctcttttctccctaaggaaaacatatgggatgtggagggtcgagtccg
aggcgtttctctcggagatgctcgatttcaattcttttttgaatctgaagttgacctccaaaaagttctc
aacaagagaccttgtcacttcaacaagtggtctttcgcattggagagatgggaaccccatgttggaacct
ccttcccaaacatcatgactttttgggttagaactgaaggcatccctgctgagttttgggatgaagaagt
gttaagaaattttggtaactctcttggtttagttcggagagttgatccctctaaaggaagaattctaatc
tctgtgacagcagatgttcctctaagattcaacaagaatgcacaactcccctctggaacagttgtcaaag
tgaaactttcttatgaaaagctctttcgctggtgctcgtattgtcgccgtatttgtcacgaactagagca
gtgtcctttactcgatgcagaacaaaaagctgttctctcagctgaggaatctcaacgtaaccttcgactc
tctcttagagatggagatagttctcaagctcgacttcctctacaatcttttccagagtctaaccgttcta
gatctgagagaaatcacttacccttgctgaatggtcctccttcctcaagaccatatcactctcgtggagt
catcagaagggaagaaaacaacttagcttcaaggtctggtcgcaatcgatcccaccgacagccttaccct
gcttccctcaatcagtaccctgctgagcatgcttccaagctgaatcgacctcatggtgctgtctcccaca
aacgcccagcctcccctgttgtcactagaaacgccggtgttgatgacggaaggaaacgtcgttttggtga
ttctttctccaaggaaaaaagctctgcacctcctaacaaacaatcatcaccactcctctcggattcacaa
ttgacgctttcagacacggtcttggctccctcaagggctaaacagataacctcttccccgacttatgtca
gggaaagaccattccggttaaatctctctaagaaggcttccgctttagagaaaggtaaaggaaaggttgt
ggagcatcctactccgttgctaggtgaatcctctgttgttgggagctcggctaagaaatctcttaacttt
gaaccctctgaaccggctccgaagaaccttgatacgcccatctctgtaacaagcaagtcgctggagccac
aactggaaaaacgaaagagttggtatgatatgactgtagaagaagatgaagctactgcgagaagtcttga
gtccggtccggatacaatacttgcagccaaattctcgcaggtggtctctttcgcaagtccatctgtcgtc
tctccatctctctctgttcttccatctcctgctgctccggaggatgagtggaatgagtctctcaatccgc
tctcggaagctctaaatttggattggactgaagaagacgaggctgcttaccatcttgccgacgacctaga
tgtggatgctgacgatttgttgagtgaggagcttcaggagagtcttcaggatcaatccggtttggtgatt
ccggggagccatgtagcacccatctctgagcttcagccggaaaaattgaagagcttgatcgtgagctcgc
ttcaatcggaggaggtggggccgtctattgctatccctaggaggaaggactcaaagaagaaagttgtgcc
ccattccagaaaagcccaactaaattcgggcctttgtttgaacttggcttcaaaaaaaattcatcacatt
tagggtttatctcccgagaaaaggctatccaacaaatccggcccatctaggccgagatcctctaggagaa
ccacgacgggtaaaagaaatggagtgccgtctacttccaagactaagagtccggttttgagtgttgtttc
ggaagggatcccaccttccgacaacaaaacatgaggcttatttcgtggaactgtcaaggggtgggaccga
aaactacatcccgtcgactggaagagatgtgtcggatgtactctccaggttttctttttttatcggaaac
taagaatgatcttgtgtatcttcagaatgtacaagtttcattaggctttgattgtttaaaaactgttgaa
cctatcggtaacagtggtggtcttgctcttttttattctcgagactacccggttaagtttatctatgtat
gtgatcgtttgattgatattgaaacaattattgatggaaaccgcgtatttattacttttgtttacggtga
tccggtagttcaataccgtgaattagtttggaaacgtttgacccgtattggtattgttcggtctgagcca
tggtttatgatcggtgattttaatgagattatcggtaatcatgaaaaaagaggaggaaaaaagagatctg
aatcctctttcctccctttctgttgtatgattgaaaactgtggaatgattgatttcccctctacagggag
tttattttcttgggtcggaaaacgaagttgtggagtggcgggccgaaagagacgtgacctcattaagtgt
cggttagatagagctatgggaaatgaagaatggcatagtatttattcccacacgaatgtggagtacctgc
agcataggggttcagatcataaacccctacttgcatccatacaaaacaaaccttatcgcccttataaaca
ttttatttttgataaacgatggataaataaaccaggttttaaggaatccgtccaagaaggatgggctttc
ccctctcgaggagagggtgttccgttttttcagaagattaaaaattgtagacaaactatatctatctgga
aaaaatccaacaaaacaaatacggagaagcttattcttgagctacattcccaacttgatttggcgtatga
agatgagaatttctctactgaagaccttcttgccctaaaatggaagctctgccaagcgtacagagacgag
gagattttttggaaacttaagagtagggagatctggctccaacttggagacatgaacacaaaattttttc
atgcgtccacaagcaaaggagggctaggaataaaattctaggtttgctaaatcaagatggcctttgggtg
gataatgaggtgggagttgaacatcttgcagagaattactttgagactctttttaccacgtcagacccac
aagtttttgattccgctctccaggaagtccccgtgctaattactgaagaaatgaataaatctctcacaaa
ggtgatctcaccagaggaagtcaagcgagctcttttttcgctaaatcctgataaagctccagggccggat
gggatgacagctttcttttatcagcactattgggaccttacggggcctgatcttattaaacttgtccaaa
atttccattccaccggtttctttgatgaacggctgaatgagaccaatatctgtttgatcccaaaaacaga
aagacctaggaaaatggcagaatttcgccctatcagtctttgcaatgtcagttacaaagttatatccaaa
gttctaagttctcgtctaaaacgtcttcttccggaattaatatccgagacgcaatctgcctttgtagcgg
agcgcttgattactgataacattcttatcgctcaagaaaattttcatgcccttcgaacaaatccagcctg
caagaagaaatacatggccattaaaactgatatgagcaaggcttatgatcgagttgaatggtctttcttg
cgagctctaatgttgaaaatgggtttcgctcagaaatgggtagattggataattttctgcatctcttcgg
tatcttacaaaattcttttaaatggatcacctaaagggtttatcaaaccatcgagaggtattcgtcaggg
cgaccctatctctccgtttctgtttatcttgtgtacagaagctcttgtagcaaagttgaaagatgcagaa
tggcatggccgcattcaaggacttcagatttcgagagctagcccatcaacatcccatcttctttttgctg
atgatagcctatttttctgtaaagctgaccctttacaaggtaaagagattattgatattctccgcctata
tggggaggcttcaggacaacagttgaatcccgacaagtcctcggtaatgttcggtcatgaggtggataac
tctatcagaaataccattaaagtatctcttgggattcataaggatgggggtatgggatcttatttgggac
tcccaaaacaaattcatggctcgaagactcaagttttttcctttgtaagagatagacttcaaaaaagaat
aaatggatggacgtccaaattcttgtctaaaaggaggcaaagagatacttatcaagtctgtggcgcaggc
tcttccgacatatgttatgtcgtgtttccttctccccaaagctatccgctcaaaactaagtagtgttgtg
gctaatttctggtggaagacaagggaggaaagtaatggtattcattggatagcttgggacaagctttgta
caccattctctgatggtggtctcgggtttagaacgttggaggaatttaatcttgtgttactagcaaagca
actttggcgcctaattagatttccaaattctctcctaagcagggtgttgaggggaaggtactttaggtat
agtgatcctattcaaattggaaaggctaatcgtccttcgtttggatggagaagcattatggctgctaaac
cactccttctctcgggtctaaggagaacaataggatctggaatgctaacccgagtgtgggaagacccttg
gatcccttcttttccaccaaggcctgcaaaaagtattcttaacataagggacacacacctctatgttaat
gatttaattgatcccgtgacaaagcaatggaaattaggtagacttcaagagttggttgacccttccgata
ttcctttgattttaggaattcggccgagtcgtacttacaaaagtgatgatttcagctggtcatttaccaa
atctgggaattatactgtaaaatccggttattgggccgcgagagatctctctcgtcctacttgtgacctc
ccttttcagggacctagtgtttcggcgcttcaggcacaagtatggaaaattaaaactacgcgtaaattta
aacatttcgagtggcaatgcctttcgggctgtctagcgacaaaccaaagacttttctcccgtcacattgg
taccgaaaaagtttgccctcgatgtggtgcagaggaagagtctataaaccatcttctttttctctgtccg
ccttcgagacaaatctgggccttgtcccctatcccgtcttcggagtacatttttccccggaattctctct
tctataattttgatttcttactctcgcgaggaaaagaatttgatatagcagaagatattatggagatctt
tccatggatactttggtatatttggaagagtcgtaaccggtttattttcgaaaacgttattgaatccccg
caagttattctggattttgcaattcaagaagcaaatgtttggaaacaagcaaattcaaaggaggtggcta
cggagtaccctcctcctcaggtggtccctgctaatcttccccctactcggaatgtttgccagtttgacgc
atcttggcatctaaaggacaccttgagtggacatggatgggtcttggtagaccaagacatcgtcctactt
ttgggccttaagagtgcacggaagagtctatcaccactacacgctgaagtagattccttgttgtgggcaa
tggaatgcatgatctccttaggtgtctcggattgctctttcgcatctgacagtgctgattttatctctct
tcttgaaaatccttcggagtggcctacctttgttgctgagttggcgacttttagttcccttgtctgtttt
ttcccttcttttagtattaagttcttttctcgtatttacaatgtgcgggccgattgcctttcaaaaaaag
cacgagcccgcaatagtcttttttccatgtaagcaagtcggttcctgattggctctctctaaaaaagagt
atctttccaattacttaatagaaatggtgtttcgacggaagaaaaaaaaa1