cis-Decoder: cDT-libraries


Mammalian common cDT-libraries

    A cDT, is a short sequence element of 6 bp or greater that is a perfect match to sequences within CSBs that are present in two or more enhancers. A cDT-library represents a collection of cDTs that are shared by the various enhancers examined. Two types of cDT-libraries have been generated in this study. First, a 'tissue-specific library' contains cDTs that are shared by a group of enhancers that regulate similar expression patterns but are absent from a second set of enhancers that direct expression in tissues outside of the first group. Second, a 'common cDT-library' contains cDTs that were shared between sets of enhancers of divergently regulated genes. A subset of common libraries included 'enriched' libraries that had a 3 fold greater representation from one enhancer type (e.g. neural) than from a second type (e.g. mesodermal).

    All libraries were generated from readouts of the CSB-aligner. Making enhancer-type specific libraries requires two different CSB-libraries generated from functionally different enhancers, a library from the tissue of interest (e.g. neural), and a second library that serves as an 'out-group' (e.g. mesodermal). For the generation of a neural cDT-library, neural CSBs in both forward and reverse directions were copy/pasted into both upper and lower windows of CSB-aligner. The resulting cDTs from this alignment are listed in the 'Result of CSB alignment table' of the CSB-aligner output, in the column titled 'Motif.' Since this cDT list contains multiple copies of different cDTs, the extra copies are removed using the Java applet Puzzamatic 1.0, a freeware created by Ron Surratt. The cDT list that contains all unique cDTs is then alphabetized and sorted by size also using Puzzamatic 1.0. The cDTs, constituting a raw neural cDT-library, were then copy/pasted into a Microsoft Word document. A second CSB-alignment is then performed with the neural CSBs in the top window of CSB-aligner, and mesodermal CSBs in the lower window. The cDTs from this alignment were freed of extra copies as above. These cDTs constituted an unedited common neural/mesodermal cDT-library. The unedited neural and common cDT-libraries are combined and cDTs common to the two libraries (present in the first and second alignments) are both removed using the JavaScript program cDT-cleaner, thus leaving only the neural-specific sequences. Neural enriched and common cDTs were curated from the unedited shared cDT-library.

    Frequency of sequence matches for each cDT was assessed using cDT-cataloger). For example the cDT AAAAAA is annotated n8;m8-AAAAAA to indicate that there were 4 hits on a mammalian neural CSB-library and 8 hit on a mammalian mesodermal CSB library. Presence of a full list of cDTs and their reverse-complement is assured using the cDT-cataloger program, especially the answer table that is generated, 'Results of cDT cataloger', which allows sorting of lists and generation of reverse-complements of cDTs.

Mammalian common cDT-libraries

Use cDT-scanner to scan an EvoPrint or a group of CSBs with copy/pasted cDT-library.
Return to cDT libraries Home Page

Mammalian Common cDTs [cDTs align with neural (n) and mesodermal (m) CSBs but not preferentially to one type]

n8;m8-AAAAAA
n3;m5-AAAAGC
n6;m8-AAAATA
n4;m9-AAACAA
n3;m6-AAACAG
n6;m4-AAACTT
n5;m3-AAAGAT
n4;m2-AAAGCC
n7;m3-AAAGGA
n5;m3-AAAGGC
n4;m7-AAAGTT
n4;m2-AAATAG
n3;m8-AAATAT
n5;m3-AAATCC
n6;m3-AAATGC
n3;m4-AAATGT
n5;m3-AAATTA
n8;m10-AACAAA
n4;m2-AACTCC
n4;m7-AACTTT
n5;m5-AAGAAA
n2;m1-AAGACA
n3;m3-AAGATA
n4;m3-AAGTCA
n6;m4-AAGTTT
n11;m5-AATAAA
n6;m6-AATAAT
n4;m10-AATATT
n8;m4-AATCAA
n3;m7-AATCAG
n3;m4-AATCCA
n7;m3-AATGAA
n2;m1-AATGGA
n13;m5-AATTAG
n4;m3-AATTGA
n7;m11-ACAAAA
n6;m3-ACAATA
n3;m3-ACACAA
n3;m3-ACAGAG
n2;m1-ACATCT
n3;m3-ACATTT
n3;m3-ACCTCC
n3;m3-AGAAAA
n4;m4-AGAAAG
n4;m3-AGACAA
n2;m2-AGACAT
n2;m1-AGATGT
n5;m3-AGCAGA
n4;m4-AGGGGA
n4;m5-AGGGGG
n3;m2-AGGTTA
n4;m4-AGTTTA
n3;m5-ATAAAG
n8;m5-ATAAAT
n2;m1-ATAACA
n3;m8-ATATTT
n4;m3-ATCAAG
n3;m4-ATCAGC
n3;m6-ATCAGG
n3;m4-ATCTGC
n5;m3-ATCTTT
n2;m2-ATGACA
n3;m4-ATGTCA
n2;m2-ATGTCT
n8;m3-ATTAAA
n12;m6-ATTAAT
n5;m5-ATTAGC
n5;m6-ATTATT
n6;m3-ATTGTG
n3;m4-ATTTAC
n8;m5-ATTTAT
n3;m4-ATTTTA
n5;m3-ATTTTC
n6;m3-CAAAGG
n4;m3-CAACAA
n3;m4-CAAGGG
n4;m3-CAATAA
n6;m5-CAATCA
n4;m2-CAATGA
n6;m7-CAATTA
n2;m4-CAATTG
n4;m3-CACAAA
n6;m3-CACAAT
n6;m9-CACACA
n2;m1-CACTGA
n5;m5-CAGACA
n3;m4-CAGATG
n5;m2-CAGGAG
n4;m2-CAGGGC
n3;m4-CATCTG
n3;m4-CATTTC
n4;m3-CCAGAC
n5;m3-CCCCAC
n4;m2-CCCCCA
n4;m6-CCCCCT
n4;m3-CCCCTC
n7;m8-CCCTCC
n3;m4-CCCTTG
n11;m6-CCTCCC
n3;m6-CCTGAT
n5;m4-CCTGGC
n4;m2-CCTTCA
n3;m2-CCTTGA
n5;m3-CCTTTC
n5;m3-CCTTTG
n4;m2-CTATTT
n13;m5-CTAATT
n9;m8-CTCCCC
n5;m2-CTCCTG
n5;m2-CTCTCC
n3;m4-CTCTGT
n3;m5-CTGACA
n4;m7-CTGATT
n5;m2-CTGCAG
n3;m6-CTGTTT
n3;m3-CTTGAT
n3;m5-CTTTAT
n7;m5-CTTTCA
n4;m5-CTTTCT
n5;m2-GAAAAA
n6;m3-GAAAAT
n5;m3-GAAAGA
n5;m3-GAAAGG
n3;m4-GAAATG
n5;m3-GAATAA
n5;m5-GACAGC
n6;m5-GAGAAA
n4;m4-GAGGGG
n2;m5-GATAAC
n3;m4-GCAGAT
n6;m3-GCATTT
n5;m4-GCCAGG
n5;m6-GCCCCC
n4;m2-GCCCTG
n5;m3-GCCTTT
n6;m5-GCTAAT
n3;m5-GCTGAT
n5;m5-GCTGTC
n3;m5-GCTTTT
n4;m5-GGAGAC
n5;m2-GGAGAG
n3;m5-GGAGGA
n7;m9-GGAGGG
n3;m3-GGAGGT
n4;m2-GGAGTT
n5;m3-GGATTT
n4;m2-GGCTTT
n4;m3-GGGAGA
n11;m6-GGGAGG
n9;m8-GGGGAG
n5;m6-GGGGGC
n3;m4-GTAAAT
n4;m5-GTCTCC
n4;m3-GTCTGG
n2;m5-GTGACA
n5;m3-GTGGGG
n5;m2-GTTAAA
n2;m5-GTTATC
n5;m3-GTTTAA
n3;m3-GTTTTA
n4;m6-TAAAAA
n3;m3-TAAAAC
n3;m4-TAAAAT
n4;m4-TAAACA
n4;m4-TAAACT
n6;m3-TAAATA
n3;m2-TAACCT
n5;m3-TAATAA
n6;m3-TAATGA
n6;m7-TAATTG
n5;m3-TAATTT
n5;m2-TACAAA
n3;m3-TAGAAA
n3;m3-TATCTT
n4;m5-TATTAA
n6;m3-TATTGT
n6;m3-TATTTA
n6;m8-TATTTT
n3;m2-TCAAGG
n4;m3-TCAATT
n2;m4-TCACAA
n2;m1-TCAGTG
n6;m3-TCATTA
n4;m2-TCATTG
n2;m1-TCCATT
n4;m4-TCCCCA
n4;m4-TCCCCT
n3;m5-TCCTCC
n7;m3-TCCTTT
n3;m5-TCTCCA
n4;m3-TCTCCC
n5;m3-TCTGCT
n5;m3-TCTTTC
n7;m5-TGAAAG
n4;m2-TGAAGG
n3;m4-TGACAT
n4;m3-TGACTT
n4;m2-TGAGAA
n6;m5-TGATTG
n3;m5-TGGAGA
n3;m4-TGGATT
n4;m4-TGGGGA
n4;m3-TGGGGG
n3;m3-TGGTAA
n5;m4-TGTAAA
n2;m5-TGTCAC
n3;m5-TGTCAG
n4;m2-TGTCAT
n5;m5-TGTCTG
n2;m1-TGTCTT
n6;m9-TGTGTG
n2;m1-TGTTAT
n4;m4-TGTTTA
n5;m5-TTAAAA
n5;m3-TTAAAC
n4;m5-TTAATA
n3;m3-TTACCA
n5;m3-TTATTA
n5;m3-TTATTC
n4;m4-TTATTG
n4;m3-TTCAAA
n7;m3-TTCATT
n4;m2-TTCTCA
n8;m4-TTGATT
n4;m3-TTGTCT
n2;m4-TTGTGA
n3;m3-TTGTGT
n4;m3-TTGTTG
n4;m1-TTGTTT
n5;m2-TTTAAC
n8;m3-TTTAAT
n5;m4-TTTACA
n11;m5-TTTATT
n3;m3-TTTCTA
n6;m5-TTTCTC
n5;m5-TTTCTT
n4;m3-TTTGAA
n5;m2-TTTGTA
n4;m3-TTTGTG
n8;m10-TTTGTT
n6;m5-TTTTAA
n3;m4-TTTTCT
n7;m11-TTTTGT
n4;m6-TTTTTA
n5;m2-TTTTTC
n8;m8-TTTTTT
n4;m5-ACAAAAG
n2;m1-ACAATGT
n2;m1-ACATTGT
n2;m1-AGCTGTA
n2;m2-AGGGGGG
n5;m2-ATTAGCA
n2;m1-ATTCTCA
n2;m1-ATTGAGA
n2;m1-ATTTGGC
n2;m2-CAGGAGG
n2;m1-CCCCACC
n2;m2-CCCCCCT
n2;m2-CCTCCTG
n4;m5-CTTTTGT
n2;m1-GAGCCAC
n2;m1-GCCAAAT
n2;m1-GGTGGGG
n2;m1-GTGGCTC
n2;m1-TACAGCT
n2;m1-TCTCAAT
n2;m1-TGAGAAT
n5;m2-TGCTAAT
n2;m1-CCAAATGG
n2;m1-CCATTTGG
n2;m2-TGTGTGTG
n2;m2-CACACACA
n2;m1-CTTTCTCTC
n2;m1-GAGAGAAAG
Contact information
Please feel free to contact Dr. Thomas Brody at brodyt@ninds.nih.gov or Dr. Ward Odenwald at odenwaldw@ninds.nih.gov if you have any questions about EvoPrinter or its applications.

Discover multispecies conserved sequences using EvoPrinter.
Return to cis-Decoder home.