Progetto Genoma Umano Perché il Progetto Genoma? La determinazione e la conoscenza dell intera sequenza genomica sembrano essere la condizione necessaria per comprendere la completa biologia di un determinato organismo
Timeline of large-scale genomic analyses
LE TAPPE DEL PROGETTO GENOMA 1953 James Watson e Francis Crick determinano la struttura del DNA (La doppia elica) 1977 Gli scienziati americani Allan Maxam and Walter Gilbert e l'inglese Frederick Sanger mettono a punto 2 diversi metodi per sequenziare il DNA, cioè per "leggere" la successione di basi nucleotidiche che lo compongono. Il metodo di Sanger, oggi automatizzato, è quello tuttora utilizzato. 1985 Lo scienziato americano Kary Mullis inventa la PCR, una tecnica che permette di moltiplicare artificialmente il DNA, anche se presente in quantità minima. 1986 Il premio Nobel Renato Dulbecco e Leroy Hood lanciano l'idea di sequenziare l'intero genoma Umano. 1990 Negli Stati Uniti nasce ufficialmente lo Human Genome Project (HGP), sotto la guida di James Watson. Negli anni successivi Regno Unito, Giappone, Francia, Germania, Cina si uniscono al progetto formando un consorzio pubblico internazionale. In Italia il progetto genoma nasce nel 1987 ma si interrompe nel 1995. 1992 Craig Venter lascia l'nih e il progetto pubblico. Fonderà una compagnia privata, la Celera Genomics, portando avanti un progetto genoma parallelo. 1993 Francis Collins e John Sulston diventano direttori rispettivamente del National Human Genome Research Center negli USA e del Sanger Center in Inghilterra, i 2 principali centri coinvolti nel HGP.
1999 (Dicembre) Pubblicata su Nature la sequenza completa del cromosoma 22. 2000 (Maggio) pubblicata su Nature la sequenza completa del cromosoma 21. 2000 (Giugno) Francis Collins e Craig Venter annunciano congiuntamente di aver completato la "bozza" del genoma Umano. 2001 La bozza completa del genoma umano (che gli inglesi chiamano working draft) è pubblicata su Nature (quella del consorzio pubblico) e su Science (quella della Celera). 2003 (Luglio) Viene pubblicata su NCBI la prima versione del hg (NCBI34/hg16) Celera Genomics (Applera, Applied Biosystems) Istituzioni pubbliche in: USA, UK, China Francia Germania Italia
Whitehead Institute Center for Genome Research, Cambridge, MA
Public approach
Livelli di copertura di cloni e sequenze
Whole genome shotgun sequencing Private approach
La grandezza totale del genoma umano aploide è di 3.070.000.000 basi di cui 2.843.000.000 sono di eucromatina
UCSC Genome Browser Screenshot from University of California at Santa Cruz http:// genome.ucsc.edu
Screen shot from the EnsEMBL project of European Bioinformatic Institute and Sanger Centre http://www.ensembl.org
Human Genome G e n e s ( e x o n s ) 1. 5-2 % C N C s 1-3 % Conserved Non Coding Regions 9 5. 0 % ~3,080,000,000 bp; ~25,000 genes sea3093
1 245,203,898 218,712,898 2 243,315,028 237,043,673 3 199,411,731 193,607,218 4 191,610,523 186,580,523 5 180,967,295 177,524,972 6 170,740,541 166,880,540 7 158,431,299 154,546,299 8 145,908,738 141,694,337 9 134,505,819 115,187,714 10 135,480,874 130,710,865 11 134,978,784 130,709,420 12 133,464,434 129,328,332 13 114,151,656 95,511,656 14 105,311,216 87,191,216 15 100,114,055 81,117,055 16 89,995,999 79,890,791 17 81,691,216 77,480,855 18 77,753,510 74,534,531 19 63,790,860 55,780,860 20 63,644,868 59,424,990 21 46,976,537 33,924,742 22 49,476,972 34,352,051 X 152,634,166 147,686,664 Y 50,961,097 22,761,097
Comparison of genetic and physical distance
Distribuzione del contenuto di GC (media 41%) Le isocore sono regioni abbastanza omogenee con percentuale crescente di contenuto di GC (L1, L2, H1, H2, H3) Nel 98% dei casi, i cloni che mappano sulle bande G (Giemsa) più scure hanno un basso contenuto di GC (37%) 80% dei cloni che mappano sulle bande G più chiare hanno un alto contenuto di GC
Istogramma del contenuto di GC (media su 20-kb)
Cromosomi umani con i colori dei corrispondenti cromosomi di topo Blocchi di sintenia: regioni del genoma dove l'ordine dei geni viene conservato, come risultato della discendenza da un antenato comune
Dimensione dei geni Gli esoni interni hanno una dimensione media di 145 bp rispetto alle 218bp del verme C. Elegans Il gene della distrofina è il più lungo (2.300.000 bp) Il gene titin ha la più lunga sequenza codificante (80.780), il più grande numero di esoni (324), il più lungo esone singolo (17.106bp)
SNPs single nucleotide polymorphisms Variazioni puntiformi della sequenza tra due copie del genoma Per un SNP un individuo può essere Omozigote T/T o C/C Eterozigote T/C
Come hanno origine gli SNP? In prima istanza da mutazioni Come si mantengono? Mediante deriva genetica nel caso di alleli neutrali Con la selezione positiva o selezione bilanciante in caso di alleli che determinano un vantaggio del fenotipo. La distribuzione di SNP nei genomi è molto variabile da genoma a genoma, da cromosoma a cromosoma, e anche all interno dei cromosomi Nell uomo in media c è uno SNP ogni 1000 bp In Drosophila e mais c è uno SNP ogni 50-100 bp Nell uomo c è uno SNP ogni 100 bp nella regione dell HLA e altre regioni dove per molte kb non si incontrano SNP
ACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAG CTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCG ACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAG CTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACA CACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTC GCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCT CTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGAT ATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGC TCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACG TAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACCGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCT CCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGAACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGA CCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCG ATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACA GCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACG TGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTC GCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACA CAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTA GCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACAC ACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCT GACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATA TAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCT CCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGC TAGCTAGCTCCTCTCGAGCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCT CGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGAC ACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGT GCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCG CGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACAC AGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAG CTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACA CAGATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACA CACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGC ACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCT CGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATAT ATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTC GAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCG CTCGAGATAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATA GCTCGCGACACACACAGATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATA TAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCT CCGACACAGCTCGCACACCGCTCGAGACCTTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCT CGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTC GACGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGAT ATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCT GACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTG ACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGA TATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATTATAGCTCGCGACACACACAGATATATAGCGTA GGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCG CTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACC TTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTTATAGCTCGCGACACACACAG ATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACACAC AGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACA CCGCTCGAGACCTTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGACGTAGG GCTCTCGATATAGCTCGCGACACACACAGATATATAGCGGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGATAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGC TCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGTAGGGCTCTCGATATAGCT CGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCC GACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTTAGCTAGCTCCTCTCG AGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATAT AGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCCTCGCGACACACACA GATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACACA CAGATATATAGCGCTCACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGT AGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCT CCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTG ACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTC GATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAAC AGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGA Variazioni della sequenza del DNA
The Human Gene Mutation Database (HGMD) Gross deletions Complex rearrang/s Gross Ins & Dupl Repeat variations Small Ins/Del Small Insertions 511 681 155 990 3779 4421 Small Deletions 10996 Single base pair substitutions Regulatory Splicing Nonsense Missense 915 6428 7742 30412 0 4000 8000 12000 16000 20000 24000 28000 32000 67030 mutations in 2478 genes sea3112
Mutazioni dei codoni umani (mutazioni independenti nei geni F8, F9, L1CAM, OTC, BTK) 400 Numbers normalised to 1,000 300 200 100 0 Phe F Leu L Ile I Met M Val V Ser S Pro P Thr T Ala A Tyr Y His H Gln Q Asn N Lys K Asp D Glu E Cys C Trp W Arg R Gly G 5082 Codons in 5 chrx genes 2446 Mutations in 5 chrx genes
Mutazioni nonsenso 15% 16% UAA UAG UGA 69% Su 731 mutazioni independenti SEA 3063 in 9 patologie del cromosoma X
SNPs nel genoma umano http://www.ncbi.nlm.nih.gov/snp/ DB SNP built 128 (23 Oct 2007) Totale 11.751.216 Totale codificanti 111.003 Codificanti sinonimi 46.621 Codificanti nonsinonimi 64.382
Sequenze ripetute Sono almeno il 50% del genoma ripetizioni derivate da trasposoni pseudogeni processati ripetizioni semplici di k-mers (A; CA; CGG) n duplicazioni segmentali blocchi di sequenze ripetuti in tandem
Long interspersed elements (LINEs) Short interspersed elements (SINEs) LTR retrotrasposons DNA trasposons
rosso = elementi ripetuti, blu = esoni
Copy Number Variation: CNV It is a widespread and common phenomenon among humans and was first uncovered following the completion of the HPG. Alteration in DNA Copy Number: amplification and deletion. Abnormal quantity of appearance of a genomic region in the genome: single gene - whole chromosome Some variations among normal individuals Can cause defects in human development Contributors to cancer Can effect function and gene expression
CNVRs contain different classes of functional elements. many CNVs preferentially lie outside genes. genes that are involved in cell-adhesion functions, sensory perception of smell and response to chemical stimuli are enriched within CNVs. Conversely, cell signalling and proliferation, as well as kinase-and phosphorylationrelated categories were underrepresented among CNVs. Interestingly, ultraconserved elements are strongly excluded from these regions.
ACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATA CTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCT ACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGC CTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCG CACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAG GCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCT CTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACA ATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACAC TCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAG TAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACCGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGC CCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGAACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACC CCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTC ATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAA GCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGAC TGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAG GCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCG CAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAG GCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACA ACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGA GACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCG TAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACA CCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACG TAGCTAGCTCCTCTCGAGCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATA CGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCC ACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACA GCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGC CGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGA AGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGC CTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACAC CAGATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGAC CACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCT ACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCC CGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGA ATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCG GAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACA CTCGAGATAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGAT GCTCGCGACACACACAGATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCG TAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACA CCGACACAGCTCGCACACCGCTCGAGACCTTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCC CGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCT GACGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACA ATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGA GACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGAC ACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCT TATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATTATAGCTCGCGACACACACAGATATATAGC GGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATA CTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAG TTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTTATAGCTCGCGACACACA ATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACA AGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGC CCGCTCGAGACCTTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGACGT GCTCTCGATATAGCTCGCGACACACACAGATATATAGCGGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGATAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATAT TCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGTAGGGCTCTCGATATA CGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGC GACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTTAGCTAGCTCCTC AGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATA AGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCCTCGCGACACAC GATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACAC CAGATATATAGCGCTCACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGA AGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGC CCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGAC ACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCT GATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGA AGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGA 1 CNV= Copy Number Variation 10% del genoma umano
ACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAG CTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCG ACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAG CTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACA CACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTC GCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCT CTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGAT ATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGC TCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACG TAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACCGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCT CCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGAACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGA CCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCG ATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACA GCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACG TGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTC GCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACA CAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTA GCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACAC ACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCT GACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATA TAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCT CCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGC TAGCTAGCTCCTCTCGAGCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCT CGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGAC ACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGT GCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCG CGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACAC AGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAG CTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACA CAGATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACA CACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGC ACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACTATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCC TGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACC TGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGAT ATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGC TCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACA CGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGC TCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACA GCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACG TGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTC GCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACA CAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTA GCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAG CTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACAC ACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCG CACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTC TCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATA TATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGGACGTAGGGCTCTCGATATAGCTCGCGACACACACAG ATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGATAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTG ACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGA CACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGAT ATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCG CACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGC ACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCC TCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCC TGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCT GAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGA CCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGAC ACACACAGATATTATAGCTCGCGACACACACAGATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGACGAGA CGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGC GCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGT GCTAGCTAGCTCCTCTCGAGACGTTATAGCTCGCGACACACACAGATATATAGCGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTC CTCTCGACGAGACGTAGGGCTCTCGATATAGCTCGCGACACACACAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTGACCTGACACGTGCTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGACACACA CAGATATATAGCGCTCCCTGAAACAGCTCCGACACAGCTCGCACACCGCTCGAGACCTTAGCTAGCTCCTCTCGAGACGTAGGGCTCTCGATATAGCTCGCGA 1 2 CNV= Copy Number Variation 10% del genoma umano
E una volta che il genoma è sequenziato? E iniziata l era post-genomica:
Post-Genomica genomica e post-genomica funzionale: cioè la comprensione su vasta scala del trascrittoma, dei network biologici e delle interazioni funzionali tra prodotti genici proteomica farmacogenomica
Tecniche di genomica e post-genomica funzionale
THE END