RIKEN Center for Integrative Medical Sciences
facilityYokohama, Japan
Research output, citation impact, and the most-cited recent papers from RIKEN Center for Integrative Medical Sciences (Japan). Aggregated across the NobleBlocks index of 300M+ scholarly works.
Top-cited papers from RIKEN Center for Integrative Medical Sciences
The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.
Regulatory T cells engage in the maintenance of immunological self-tolerance by actively suppressing self-reactive lymphocytes. Little is known, however, about the molecular mechanism of their development. Here we show that Foxp3, which encodes a transcription factor that is genetically defective in an autoimmune and inflammatory syndrome in humans and mice, is specifically expressed in naturally arising CD4+ regulatory T cells. Furthermore, retroviral gene transfer of Foxp3 converts naïve T cells toward a regulatory T cell phenotype similar to that of naturally occurring CD4+ regulatory T cells. Thus, Foxp3 is a key regulatory gene for the development of regulatory T cells.
The innate immune system in drosophila and mammals senses the invasion of microorganisms using the family of Toll receptors, stimulation of which initiates a range of host defense mechanisms. In drosophila antimicrobial responses rely on two signaling pathways: the Toll pathway and the IMD pathway. In mammals there are at least 10 members of the Toll-like receptor (TLR) family that recognize specific components conserved among microorganisms. Activation of the TLRs leads not only to the induction of inflammatory responses but also to the development of antigen-specific adaptive immunity. The TLR-induced inflammatory response is dependent on a common signaling pathway that is mediated by the adaptor molecule MyD88. However, there is evidence for additional pathways that mediate TLR ligand-specific biological responses.
Eukaryotic cells make many types of primary and processed RNAs that are found either in specific subcellular compartments or throughout the cells. A complete catalogue of these RNAs is not yet available and their characteristic subcellular localizations are also poorly understood. Because RNA represents the direct output of the genetic information encoded by genomes and a significant proportion of a cell’s regulatory capabilities are focused on its synthesis, processing, transport, modification and translation, the generation of such a catalogue is crucial for understanding genome function. Here we report evidence that three-quarters of the human genome is capable of being transcribed, as well as observations about the range and levels of expression, localization, processing fates, regulatory regions and modifications of almost all currently annotated and thousands of previously unannotated RNAs. These observations, taken together, prompt a redefinition of the concept of a gene. A description is given of the ENCODE effort to provide a complete catalogue of primary and processed RNAs found either in specific subcellular compartments or throughout the cell, revealing that three-quarters of the human genome can be transcribed, and providing a wealth of information on the range and levels of expression, localization, processing fates and modifications of known and previously unannotated RNAs. These authors describe the ENCODE (Encyclopedia of DNA Elements) effort to provide a complete catalogue of primary and processed RNAs found either in specific sub-cellular compartments or throughout the cell. They show that three-quarters of the human genome can be transcribed, and provide a wealth of information about the range and levels of expression, localization, processing fates and modifications of both known and previously unannotated RNAs. Collectively, these observations suggest that the current concept of a gene should be revisited.
The human genome contains many thousands of long noncoding RNAs (lncRNAs). While several studies have demonstrated compelling biological and disease roles for individual examples, analytical and experimental approaches to investigate these genes have been hampered by the lack of comprehensive lncRNA annotation. Here, we present and analyze the most complete human lncRNA annotation to date, produced by the GENCODE consortium within the framework of the ENCODE project and comprising 9277 manually annotated genes producing 14,880 transcripts. Our analyses indicate that lncRNAs are generated through pathways similar to that of protein-coding genes, with similar histone-modification profiles, splicing signals, and exon/intron lengths. In contrast to protein-coding genes, however, lncRNAs display a striking bias toward two-exon transcripts, they are predominantly localized in the chromatin and nucleus, and a fraction appear to be preferentially processed into small RNAs. They are under stronger selective pressure than neutrally evolving sequences-particularly in their promoter regions, which display levels of selection comparable to protein-coding genes. Importantly, about one-third seem to have arisen within the primate lineage. Comprehensive analysis of their expression in multiple human organs and brain regions shows that lncRNAs are generally lower expressed than protein-coding genes, and display more tissue-specific expression patterns, with a large fraction of tissue-specific lncRNAs expressed in the brain. Expression correlation analysis indicates that lncRNAs show particularly striking positive correlation with the expression of antisense coding genes. This GENCODE annotation represents a valuable resource for future studies of lncRNAs.
Abstract Somatic mutations in cancer genomes are caused by multiple mutational processes, each of which generates a characteristic mutational signature 1 . Here, as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium 2 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), we characterized mutational signatures using 84,729,690 somatic mutations from 4,645 whole-genome and 19,184 exome sequences that encompass most types of cancer. We identified 49 single-base-substitution, 11 doublet-base-substitution, 4 clustered-base-substitution and 17 small insertion-and-deletion signatures. The substantial size of our dataset, compared with previous analyses 3–15 , enabled the discovery of new signatures, the separation of overlapping signatures and the decomposition of signatures into components that may represent associated—but distinct—DNA damage, repair and/or replication mechanisms. By estimating the contribution of each signature to the mutational catalogues of individual cancer genomes, we revealed associations of signatures to exogenous or endogenous exposures, as well as to defective DNA-maintenance processes. However, many signatures are of unknown cause. This analysis provides a systematic perspective on the repertoire of mutational processes that contribute to the development of human cancer.
CD4(+) T regulatory cells (T(regs)), which express the Foxp3 transcription factor, play a critical role in the maintenance of immune homeostasis. Here, we show that in mice, T(regs) were most abundant in the colonic mucosa. The spore-forming component of indigenous intestinal microbiota, particularly clusters IV and XIVa of the genus Clostridium, promoted T(reg) cell accumulation. Colonization of mice by a defined mix of Clostridium strains provided an environment rich in transforming growth factor-β and affected Foxp3(+) T(reg) number and function in the colon. Oral inoculation of Clostridium during the early life of conventionally reared mice resulted in resistance to colitis and systemic immunoglobulin E responses in adult mice, suggesting a new therapeutic approach to autoimmunity and allergy.
Interferons (IFNs) are critical for protection from viral infection, but the pathways linking virus recognition to IFN induction remain poorly understood. Plasmacytoid dendritic cells produce vast amounts of IFN-alpha in response to the wild-type influenza virus. Here, we show that this requires endosomal recognition of influenza genomic RNA and signaling by means of Toll-like receptor 7 (TLR7) and MyD88. Single-stranded RNA (ssRNA) molecules of nonviral origin also induce TLR7-dependent production of inflammatory cytokines. These results identify ssRNA as a ligand for TLR7 and suggest that cells of the innate immune system sense endosomal ssRNA to detect infection by RNA viruses.
Naturally occurring CD4+ regulatory T cells, the majority of which express CD25, are engaged in dominant control of self-reactive T cells, contributing to the maintenance of immunologic self-tolerance. Their depletion or functional alteration leads to the development of autoimmune disease in otherwise normal animals. The majority, if not all, of such CD25+CD4+ regulatory T cells are produced by the normal thymus as a functionally distinct and mature subpopulation of T cells. Their repertoire of antigen specificities is as broad as that of naive T cells, and they are capable of recognizing both self and nonself antigens, thus enabling them to control various immune responses. In addition to antigen recognition, signals through various accessory molecules and via cytokines control their activation, expansion, and survival, and tune their suppressive activity. Furthermore, the generation of CD25+CD4+ regulatory T cells in the immune system is at least in part developmentally and genetically controlled. Genetic defects that primarily affect their development or function can indeed be a primary cause of autoimmune and other inflammatory disorders in humans. Based on recent advances in our understanding of the cellular and molecular basis of this T cell-mediated immune regulation, this review discusses how naturally arising CD25+CD4+ regulatory T cells contribute to the maintenance of immunologic self-tolerance and negative control of various immune responses, and how they can be exploited to prevent and treat autoimmune disease, allergy, cancer, and chronic infection, or establish donor-specific transplantation tolerance.
Abstract Cancer is driven by genetic change, and the advent of massively parallel sequencing has enabled systematic documentation of this variation at the whole-genome scale 1–3 . Here we report the integrative analysis of 2,658 whole-cancer genomes and their matching normal tissues across 38 tumour types from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). We describe the generation of the PCAWG resource, facilitated by international data sharing using compute clouds. On average, cancer genomes contained 4–5 driver mutations when combining coding and non-coding genomic elements; however, in around 5% of cases no drivers were identified, suggesting that cancer driver discovery is not yet complete. Chromothripsis, in which many clustered structural variants arise in a single catastrophic event, is frequently an early event in tumour evolution; in acral melanoma, for example, these events precede most somatic point mutations and affect several cancer-associated genes simultaneously. Cancers with abnormal telomere maintenance often originate from tissues with low replicative activity and show several mechanisms of preventing telomere attrition to critical levels. Common and rare germline variants affect patterns of somatic mutation, including point mutations, structural variants and somatic retrotransposition. A collection of papers from the PCAWG Consortium describes non-coding mutations that drive cancer beyond those in the TERT promoter 4 ; identifies new signatures of mutational processes that cause base substitutions, small insertions and deletions and structural variation 5,6 ; analyses timings and patterns of tumour evolution 7 ; describes the diverse transcriptional consequences of somatic mutation on splicing, expression levels, fusion genes and promoter activity 8,9 ; and evaluates a range of more-specialized features of cancer genomes 8,10–18 .
Stimulation of Toll-like receptors (TLRs) triggers activation of a common MyD88-dependent signaling pathway as well as a MyD88-independent pathway that is unique to TLR3 and TLR4 signaling pathways leading to interferon (IFN)-beta production. Here we disrupted the gene encoding a Toll/IL-1 receptor (TIR) domain-containing adaptor, TRIF. TRIF-deficient mice were defective in both TLR3- and TLR4-mediated expression of IFN-beta and activation of IRF-3. Furthermore, inflammatory cytokine production in response to the TLR4 ligand, but not to other TLR ligands, was severely impaired in TRIF-deficient macrophages. Mice deficient in both MyD88 and TRIF showed complete loss of nuclear factor kappa B activation in response to TLR4 stimulation. These findings demonstrate that TRIF is essential for TLR3- and TLR4-mediated signaling pathways facilitating mammalian antiviral host defense.
Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly ‘housekeeping’, whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research. A study from the FANTOM consortium using single-molecule cDNA sequencing of transcription start sites and their usage in human and mouse primary cells, cell lines and tissues reveals insights into the specificity and diversity of transcription patterns across different mammalian cell types. FANTOM5 (standing for functional annotation of the mammalian genome 5) is the fifth major stage of a major international collaboration that aims to dissect the transcriptional regulatory networks that define every human cell type. Two Articles in this issue of Nature present some of the project's latest results. The first paper uses the FANTOM5 panel of tissue and primary cell samples to define an atlas of active, in vivo bidirectionally transcribed enhancers across the human body. These authors show that bidirectional capped RNAs are a signature feature of active enhancers and identify more than 40,000 enhancer candidates from over 800 human cell and tissue samples. The enhancer atlas is used to compare regulatory programs between different cell types and identify disease-associated regulatory SNPs, and will be a resource for studies on cell-type-specific enhancers. In the second paper, single-molecule sequencing is used to map human and mouse transcription start sites and their usage in a panel of distinct human and mouse primary cells, cell lines and tissues to produce the most comprehensive mammalian gene expression atlas to date. The data provide a plethora of insights into open reading frames and promoters across different cell types in addition to valuable annotation of mammalian cell-type-specific transcriptomes.
Disorders of the brain can exhibit considerable epidemiological comorbidity and often share symptoms, provoking debate about their etiologic overlap. We quantified the genetic sharing of 25 brain disorders from genome-wide association studies of 265,218 patients and 784,643 control participants and assessed their relationship to 17 phenotypes from 1,191,588 individuals. Psychiatric disorders share common variant risk, whereas neurological disorders appear more distinct from one another and from the psychiatric disorders. We also identified significant sharing between disorders and a number of brain phenotypes, including cognitive measures. Further, we conducted simulations to explore how statistical power, diagnostic misclassification, and phenotypic heterogeneity affect genetic correlations. These results highlight the importance of common genetic variation as a risk factor for brain disorders and the value of heritability-based methods in understanding their etiology.
Full-length cDNAs are essential for functional analysis of plant genes in the post-sequencing era of the Arabidopsis genome. Recently, cDNA microarray analysis has been developed for quantitative analysis of global and simultaneous analysis of expression profiles. We have prepared a full-length cDNA microarray containing approximately 7000 independent, full-length cDNA groups to analyse the expression profiles of genes under drought, cold (low temperature) and high-salinity stress conditions over time. The transcripts of 53, 277 and 194 genes increased after cold, drought and high-salinity treatments, respectively, more than fivefold compared with the control genes. We also identified many highly drought-, cold- or high-salinity- stress-inducible genes. However, we observed strong relationships in the expression of these stress-responsive genes based on Venn diagram analysis, and found 22 stress-inducible genes that responded to all three stresses. Several gene groups showing different expression profiles were identified by analysis of their expression patterns during stress-responsive gene induction. The cold-inducible genes were classified into at least two gene groups from their expression profiles. DREB1A was included in a group whose expression peaked at 2 h after cold treatment. Among the drought, cold or high-salinity stress-inducible genes identified, we found 40 transcription factor genes (corresponding to approximately 11% of all stress-inducible genes identified), suggesting that various transcriptional regulatory mechanisms function in the drought, cold or high-salinity stress signal transduction pathways.
Antisense transcription (transcription from the opposite strand to a protein-coding or sense strand) has been ascribed roles in gene regulation involving degradation of the corresponding sense transcripts (RNA interference), as well as gene silencing at the chromatin level. Global transcriptome analysis provides evidence that a large proportion of the genome can produce transcripts from both strands, and that antisense transcripts commonly link neighboring "genes" in complex loci into chains of linked transcriptional units. Expression profiling reveals frequent concordant regulation of sense/antisense pairs. We present experimental evidence that perturbation of an antisense RNA can alter the expression of sense messenger RNAs, suggesting that antisense transcription contributes to control of transcriptional outputs in mammals.
Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense-antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.
Lancelets (‘amphioxus’) are the modern survivors of an ancient chordate lineage, with a fossil record dating back to the Cambrian period. Here we describe the structure and gene content of the highly polymorphic ∼520-megabase genome of the Florida lancelet Branchiostoma floridae, and analyse it in the context of chordate evolution. Whole-genome comparisons illuminate the murky relationships among the three chordate groups (tunicates, lancelets and vertebrates), and allow not only reconstruction of the gene complement of the last common chordate ancestor but also partial reconstruction of its genomic organization, as well as a description of two genome-wide duplications and subsequent reorganizations in the vertebrate lineage. These genome-scale events shaped the vertebrate genome and provided additional genetic variation for exploitation during vertebrate evolution. This issue sees the publication of the draft genome sequence of an animal that has been studied by biologists for many years as a model for a primitive chordate. The amphioxus or lancelet is a small worm-like creature, usually to be found buried in sand on the sea floor. Comparative analysis of the genome of the Florida lancelet, Branchiostoma floridae, reveals 17 ancestral chordate linkage groups conserved in the modern amphioxus and vertebrate genomes despite more than half a billion years of independent evolution. From this it possible to make a virtual reconstruction of the 17 chromosomes of the last common chordate ancestor. This reconstruction conforms that two rounds of whole genome duplication have occurred during evolution of the jawed vertebrate lineage. And it illuminates the murky relationships between the three chordate groups, the tunicates, lancelets and vertebrates. The cover shows four adult amphioxus collected in Apalachee Bay, Florida, with anterior towards the top and dorsal to the right. Yellow ovals are gonads. (Photo by Nicholas Putnam, DOE Joint Genome Institute.
In this Review, Akihiko Yoshimura and collegues discuss the most recent advances in our understanding of suppressor of cytokine signalling (SOCS) proteins in the regulation of immunity, their involvement in human diseases and the therapeutic implications of targeting this family of cytokine regulators. Suppressor of cytokine signalling (SOCS) proteins are inhibitors of cytokine signalling pathways. Studies have shown that SOCS proteins are key physiological regulators of both innate and adaptive immunity. These molecules positively and negatively regulate macrophage and dendritic-cell activation and are essential for T-cell development and differentiation. Evidence is also emerging of the involvement of SOCS proteins in diseases of the immune system. In this Review we bring together data from recent studies on SOCS proteins and their role in immunity, and propose a cohesive model of how cytokine signalling regulates immune-cell function.
Bipolar disorder is a heritable mental illness with complex etiology. We performed a genome-wide association study of 41,917 bipolar disorder cases and 371,549 controls of European ancestry, which identified 64 associated genomic loci. Bipolar disorder risk alleles were enriched in genes in synaptic signaling pathways and brain-expressed genes, particularly those with high specificity of expression in neurons of the prefrontal cortex and hippocampus. Significant signal enrichment was found in genes encoding targets of antipsychotics, calcium channel blockers, antiepileptics and anesthetics. Integrating expression quantitative trait locus data implicated 15 genes robustly linked to bipolar disorder via gene expression, encoding druggable targets such as HTR6, MCHR1, DCLK3 and FURIN. Analyses of bipolar disorder subtypes indicated high but imperfect genetic correlation between bipolar disorder type I and II and identified additional associated loci. Together, these results advance our understanding of the biological etiology of bipolar disorder, identify novel therapeutic leads and prioritize genes for functional follow-up studies.
Naturally arising CD25+ CD4+ regulatory T (Treg) cells, most of which are produced by the normal thymus as a functionally mature T-cell subpopulation, play key roles in the maintenance of immunologic self-tolerance and negative control of a variety of physiological and pathological immune responses. Natural Tregs specifically express Foxp3, a transcription factor that plays a critical role in their development and function. Complete depletion of Foxp3-expressing natural Tregs, whether they are CD25+ or CD25-, activates even weak or rare self-reactive T-cell clones, inducing severe and widespread autoimmune/inflammatory diseases. Natural Tregs are highly dependent on exogenously provided interleukin (IL)-2 for their survival in the periphery. In addition to Foxp3 and IL-2/IL-2 receptor, deficiency or functional alteration of other molecules, expressed by T cells or non-T cells, may affect the development/function of Tregs or self-reactive T cells, or both, and consequently tip the peripheral balance between the two populations toward autoimmunity. Elucidation of the molecular and cellular basis of this Treg-mediated active maintenance of self-tolerance will facilitate both our understanding of the pathogenetic mechanism of autoimmune disease and the development of novel methods of autoimmune disease prevention and treatment via enhancing and re-establishing Treg-mediated dominant control over self-reactive T cells.