• LOGIN
    Login with username and password
Repository logo

BORIS Portal

Bern Open Repository and Information System

  • Publications
  • Theses
  • Research Data
  • Projects
  • Organizations
  • Researchers
  • More
  • Collections
  • Statistics
  • LOGIN
    Login with username and password
Repository logo
Unibern.ch
  1. Home
  2. Publications
  3. GENCODE: massively expanding the lncRNA catalog through capture long-read RNA sequencing.
 

GENCODE: massively expanding the lncRNA catalog through capture long-read RNA sequencing.

Options
  • Details
  • Files
BORIS DOI
10.48620/76980
Publisher DOI
10.1101/2024.10.29.620654
PubMed ID
39554180
Description
Accurate and complete gene annotations are indispensable for understanding how genome sequences encode biological functions. For twenty years, the GENCODE consortium has developed reference annotations for the human and mouse genomes, becoming a foundation for biomedical and genomics communities worldwide. Nevertheless, collections of important yet poorly-understood gene classes like long non-coding RNAs (lncRNAs) remain incomplete and scattered across multiple, uncoordinated catalogs, slowing down progress in the field. To address these issues, GENCODE has undertaken the most comprehensive lncRNAs annotation effort to date. This is founded on the manual annotation of full-length targeted long-read sequencing, on matched embryonic and adult tissues, of orthologous regions in human and mouse. Altogether 17,931 novel human genes (140,268 novel transcripts) and 22,784 novel mouse genes (136,169 novel transcripts) have been added to the GENCODE catalog representing a 2-fold and 6-fold increase in transcripts, respectively - the greatest increase since the sequencing of the human genome. Novel gene annotations display evolutionary constraints, have well-formed promoter regions, and link to phenotype-associated genetic variants. They greatly enhance the functional interpretability of the human genome, as they help explain millions of previously-mapped "orphan" omics measurements corresponding to transcription start sites, chromatin modifications and transcription factor binding sites. Crucially, our targeted design assigned human-mouse orthologs at a rate beyond previous studies, tripling the number of human disease-associated lncRNAs with mouse orthologs. The expanded and enhanced GENCODE lncRNA annotations mark a critical step towards deciphering the human and mouse genomes.
Date of Publication
2024-10-31
Publication Type
Working Paper
Language(s)
en
Contributor(s)
Kaur, Gazaldeep
Perteghella, Tamara
Carbonell-Sala, Sílvia
Gonzalez-Martinez, Jose
Hunt, Toby
Mądry, Tomasz
Jungreis, Irwin
Arnan, Carme
Lagarde, Julien
Borsari, Beatrice
Sisu, Cristina
Jiang, Yunzhe
Bennett, Ruth
Berry, Andrew
Cerdán-Vélez, Daniel
Cochran, Kelly
Vara, Covadonga
Davidson, Claire
Donaldson, Sarah
Dursun, Cagatay
González-López, Silvia
Gopal Das, Sasti
Hardy, Matthew
Hollis, Zoe
Kay, Mike
Montañés, José Carlos
Ni, Pengyu
Nurtdinov, Ramil
Palumbo, Emilio
Pulido-Quetglas, Carlos
Clinic of Medical Oncology
Suner, Marie-Marthe
Yu, Xuezhu
Zhang, Dingyao
Loveland, Jane E
Albà, M Mar
Diekhans, Mark
Tanzer, Andrea
Mudge, Jonathan M
Flicek, Paul
Martin, Fergal J
Gerstein, Mark
Kellis, Manolis
Kundaje, Anshul
Paten, Benedict
Tress, Michael L
Johnson, Rory Baldwin
Clinic of Medical Oncology
Uszczynska-Ratajczak, Barbara
Frankish, Adam
Guigó, Roderic
Additional Credits
Clinic of Medical Oncology
Publisher
Cold Spring Harbor Laboratory
ISSN
2692-8205
Access(Rights)
open.access
Show full item
BORIS Portal
Bern Open Repository and Information System
Build: dd892c [ 9.04. 8:30]
Explore
  • Projects
  • Funding
  • Publications
  • Research Data
  • Organizations
  • Researchers
  • Audiovisual Material
  • Software & other digital items
  • Events
More
  • About BORIS Portal
  • Send Feedback
  • Cookie settings
  • Service Policy
Follow us on
  • Mastodon
  • YouTube
  • LinkedIn
UniBe logo