La lecture à portée de main
Découvre YouScribe en t'inscrivant gratuitement
Je m'inscrisDécouvre YouScribe en t'inscrivant gratuitement
Je m'inscrisDescription
Sujets
Informations
Publié par | profil-zyak-2012 |
Publié le | 01 janvier 2008 |
Nombre de lectures | 42 |
Langue | English |
Poids de l'ouvrage | 6 Mo |
Extrait
Thèse présentée pour obtenir le titre de
Docteur de l’Université Louis Pasteur Strasbourg 1
Discipline : Sciences du Vivant
Spécialité : Bioinformatique
par
Ravi Kiran Reddy KALATHUR
Approche systématique et intégrative pour le stockage, l’analyse et la
visualisation des données d’expression génique acquises par des techniques
à haut débit, dans des tissus neuronaux
(An integrated systematic approach for storage, analysis and visualization of
gene expression data from neuronal tissues acquired through high-throughput
techniques)
Soutenue publiquement le 15 Janvier 2008 devant le jury :
Directeur de thèse Olivier POCH, IGBMC, Illkirch
Rapporteur interne Brigitte KIEFFER, IGBMC, Illkirch
Rapporteur externe Christian GRIMM, UZH, Zurich Pascal BARBRY, IPMC, Valbonne
Examinateur Jean-Marie WURTZ, IGBMC, Illkirch
Membre invité Thierry LÉVEILLARD, INSERM unit 592, Paris Dedicated to my Mother
Acknowledgements
I would like to thank the members of my jury Pascal Barbry, Christian Grimm, Jean-
Marie Wurtz, Brigitte Kieffer and Thierry Léveillard for taking time out of their
schedules to review my thesis.
I would also like to thank the director of the IGBMC – Dino Moras and Jean-Claude
Thierry for their support.
At the outset, I would like to thank my guide and mentor Olivier Poch for
giving me the opportunity to work in his laboratory over the past three years. He has
been a source of inspiration and very patient with me, especially over the past few
months. Working with Olivier has been a positive and enriching experience for me
and will remain with me for a long time to come.
I am immensely grateful to Julie for patiently proofreading our articles and
chapters of my thesis and I am indebted to her for this.
All the members of my laboratory have been a great source of strength and support to
me. A special thanks to Wolfgang Raffelsberger for his constant guidance. Nicolas
Wicker and I worked together towards two of my projects and he has helped me with
my thesis to a great extent.
I would like to thank Nicolas and Guillaume for their extensive help with
respect to the RETINOBASE project. I would also like to thank Raymond Ripp for
helping me with RETINOBASE and guiding me throughout the writing of this
chapter.
I would like to thank Radhouene for helping me to translate my resume in
French and Anne for helping me with thesis formatting and for asking me “ça va!”
every morning.
Laetitia and Naomi were the first end users of the database that I designed
during my thesis; I appreciate their comments and suggestions.
My stay in the lab would not be complete if I did not mention the help from
my other colleagues including Luc, Vero, Annaick, Yann, Laurent-Philippe, Yannick-
Noel, Fred, Laurent, Odile, Ngoc-Hoan, David, Sophie and Emmanuel.
Once again I would like to sincerely thank Olivier Poch and his team, without
their help this thesis would not have been possible.
I would like to thank Thierry Léveillard for his valuable suggestions regarding
RETINOBASE.
Thanks! Serge for maintaining our servers. I would like to thank Latetia for
helping me out with my documents to the university and travel arrangements.
This entire project has been supported by RETNET funded through the European
Union research programme MRTN-CN-2003-504003. I would like to thank all the
members of RETNET for their valuable suggestions and guidance.
On a personal front I would like to thank Raj and Ullas and their families for
creating for me a home away from home and for always being there for me. All my
friends here at the IGBMC who have been a stronghold and a great support to me, I
am glad I have friends like you. Murugan, Beena, Jai, Amit and Harshal thanks for
being around!
Last but not least my heartfelt gratitude goes out to my dear parents, my
brother and sister-in-law for always being there for me. This would not have been
possible without you, and so I dedicate this work to you.
List of abbreviations
Abbreviations (Computer and Statistics)
AIC: Akaike Information Criterion
BIC: Bayesian Information Criterion
CEM: Competitive Expectation Maximisation
CGED: Cancer Gene Expression Database
CSS: Cascading Style Sheets
DBMS: DataBase Management System
EM: Expectation Maximisation
FTP: File Transfer Protocol
HTML: Hypertext Markup Language
LDAP (netscape): Lightweight Directory Access Protocol
MLA: Maximum Likelihood Approximation
ODBMS: Object Database Management System
PHP: Hypertext Preprocessor
RDBMS: Relational Database Management System
SQL: Structured Query Language
Abbreviations (Biology and Bioinformatics)
CEL: Contains information about each probe on the chip is extracted from the image
data by the Affymetrix image analysis software.
dChip: DNA-Chip Analyzer
DDBJ: DNA Data Bank of Japan
DNA: Deoxyribonucleic acid
EMBL: The European Molecular Biology Laboratory
EST: Expressed Sequence Tag
FASABI: Functional And Statistical of Biological Data
GEO: Gene Expression Omnibus
GO: Gene Ontology
GOLD: Genomes OnLine Database
KEGG: Kyoto Encyclopedia of Genes and Genomes
MAS 5.0: Affymetix Microarray Suite 5.0
MIAME: Minimum Information About a Microarray Experiment
OMIM: Online Mendelian Inheritance in Man
PCR: Polymerase Chain Reaction
RD: Retinal Disease
RETNET: European Retinal Research Training Network
RISC: RNA-induced silencing complex
RMA: Robust Multi-array Analysis
RNA: Ribonucleic acid
SAGE: Seiral Analysis of Gene Expression
SIEGE: Smoke Induced Epithelial Gene Expression
SNP: Single Nucleotide polymorphism
SPR/SPRR: Small Proline-Rich Proteins
SR: Serin/Arginine
UTR: UnTranslated Region
List of contents
Chapter 1. Avant-Propos.............................................................................................1
Introduction..................................................4
Chapter 2. Biology and Bioinformatics......................................5
2.1 Central dogma of molecular biology...5
2.2 Timeline of major events in informatics, molecular biology and bioinformatics6
2.3 Genome sequencing...........................................................................................11
2.4 Interdependency between computational and experimental techniques............12
Chapter 3. Biological Databases...............14
3.1 General introduction to databases......................................................................14
3.1.1 Object Database Management System (ODBMS)......14
3.1.2 Relational Database Management System (RDBMS)15
3.1.1.1 Advantages of relational model...........................................................16
3.2 Biological databases................................17
3.2.1 Specific features of biological databases....................18
3.2.2 Database challenges20
3.3 Classification of biological databases................................................................21
3.4 Important databases for molecular biology........................23
3.4.1 Nucleic acid sequence databases.23
3.4.2 Protein sequence databases: the UniProt databases....24
3.4.3 Protein structure database ...........................................................................25
3.4.4 Pathway database: KEGG database............................27
3.4.5 Gene Ontology database.............28
3.4.5.1 DAVID bioinformatic resources ..........................................................30
3.5. Sequence Retrieval Software (SRS).................................31
3.5.1 Entrez -National Center for Biotechnology Information (NCBI)...............32
3.6 UCSC genome browser for gene localization....................33
3.7 Microarray databases .........................................................................................35
3.7.1 Types of Microarray databases...................................35
3.7.1.1 General repositories.............35
3.7.1.1.1 Gene Expression Omnibus (GEO)................35
3.7.1.1.2 ArrayExpress.................................................................................37
3.7.1.2 Microarray database system.................................................................38
3.7.1.2.1 BioArray Software Environment (BASE)....38
3.7.1.2.2 Mediante........................39
Chapter 4. Transcriptome .........................................................41
4.1 Regulation of gene expression...........................................41
4.1.1 Transcriptional control in eukaryotic cells.................42
4.1.2 Posttranscriptional controls.........................................44
4.1.2.1 Capping................................................................45