High-throughput annotation of full-length long noncoding RNAs with capture long-read sequencing.

Lagarde, Julien; Uszczynska-Ratajczak, Barbara; Carbonell, Silvia; Pérez-Lluch, Sílvia; Abad, Amaya; Davis, Carrie; Gingeras, Thomas R; Frankish, Adam; Harrow, Jennifer; Guigo, Roderic; Johnson, Rory Baldwin

doi:10.7892/boris.116920

High-throughput annotation of full-length long noncoding RNAs with capture long-read sequencing.

BORIS DOI

10.7892/boris.116920

Date of Publication

December 2017

Publication Type

Article

Division/Institute

Universitätsklinik fü...

Contributor

Lagarde, Julien
Uszczynska-Ratajczak, Barbara
Carbonell, Silvia
Pérez-Lluch, Sílvia
Abad, Amaya
Davis, Carrie
Gingeras, Thomas R
Frankish, Adam
Harrow, Jennifer
Guigo, Roderic
Johnson, Rory Baldwin	Universitätsklinik für Medizinische Onkologie

Subject(s)

600 - Technology::610...

Series

Nature genetics

ISSN or ISBN (if monograph)

1061-4036

Publisher

Nature America

Language

English

Publisher DOI

10.1038/ng.3988

PubMed ID

29106417

Description

Accurate annotation of genes and their transcripts is a foundation of genomics, but currently no annotation technique combines throughput and accuracy. As a result, reference gene collections remain incomplete-many gene models are fragmentary, and thousands more remain uncataloged, particularly for long noncoding RNAs (lncRNAs). To accelerate lncRNA annotation, the GENCODE consortium has developed RNA Capture Long Seq (CLS), which combines targeted RNA capture with third-generation long-read sequencing. Here we present an experimental reannotation of the GENCODE intergenic lncRNA populations in matched human and mouse tissues that resulted in novel transcript models for 3,574 and 561 gene loci, respectively. CLS approximately doubled the annotated complexity of targeted loci, outperforming existing short-read techniques. Full-length transcript models produced by CLS enabled us to definitively characterize the genomic features of lncRNAs, including promoter and gene structure, and protein-coding potential. Thus, CLS removes a long-standing bottleneck in transcriptome annotation and generates manual-quality full-length transcript models at high-throughput scales.

Handle

https://boris-portal.unibe.ch/handle/20.500.12422/162258

Show full item

File(s)

File	File Type	Format	Size	License	Publisher/Copright statement	Content
105064.full.pdf	text	Adobe PDF	5.74 MB	Attribution (CC BY 4.0)		submitted	Open
ng.3988.pdf	text	Adobe PDF	7.33 MB	publisher		published	restricted

High-throughput annotation of full-length long noncoding RNAs with capture long-read sequencing.

Options