PHILADELPHIA—A new genetic clue discovered by a team co-led by a researcher at the Perelman School of Medicine at the University of Pennsylvania is shedding light on the functions of the mysterious “long non-coding RNAs” (lncRNAs). These molecules are transcribed from genes and are often abundant in cells, yet they do not code for proteins. Their functions have been almost entirely unknown—and in recent years have attracted much research and debate.
Reporting in the journal Molecular Cell, the scientists determined that one prominent lncRNA may be a “red herring,” with no evident biological role to play—whereas the DNA from which it originates does perform an important function, as an “enhancer” that stimulates the expression of an important protein-coding gene nearby.
“An implication of this finding is that many lncRNA-producing regions of DNA may not function through their lncRNA products at all, but only through the DNA itself,” Vikram R. Paralkar, MD, an instructor in the division of Hematology/Oncology at Penn’s Perelman School of Medicine, and first author of the study, said.
The discovery underscores the importance of non-protein-coding DNA in biology, and the importance of exploring its functions. Although a popular view still has it that DNA is transcribed into RNA, and the RNA is usually translated into protein, scientists recently have learned that protein-coding DNA makes up less than two percent of the mammalian genome—and is vastly outnumbered by non-protein-coding DNA.
Some non-protein-coding genes produce small RNA molecules, and many of these have known functions, often in regulating other genes or RNAs. But thousands of our genes produce lncRNAs—defined as being at least 200 nucleotides in length—and their functions remain poorly understood.
In a study published in 2014, Paralkar and colleagues identified over a thousand distinct lncRNAs in mouse and human blood cells. Most had never been described before. But many turned out to originate from areas of the genome containing known or suspected enhancers—small regions of DNA where transcription factor molecules gather, to stimulate the transcription of nearby genes.
In the new study, Paralkar and colleagues, including senior author Mitchell J. Weiss, MD, of St. Jude Children’s Research Hospital in Memphis, and researchers from The Children's Hospital of Philadelphia, examined the possible enhancer function associated with one of these mystery lncRNAs, a product of a mouse gene called Lockd.
The Lockd RNA product is particularly abundant in mouse red blood cells and some other cell types. One possibility is that this lncRNA has some undiscovered function in cells. Another possibility is that the Lockd DNA itself has some function, while the RNA transcript is without function—a genomic “red herring.”
Lockd’s transcription-starting “promoter” region contains binding sites for multiple transcription factors, and it lies immediately downstream, on the mouse genome, to Cdkn1b, a gene whose protein product plays a key role in regulating cell division.
To investigate the functions of Lockd, Paralkar and colleagues used an advanced gene-editing technique to delete the Lockd DNA from a mouse blood cell line. “When we did this, the expression of Cdkn1b was reduced by 70 percent,” Paralkar said.
Next, the researchers used a different technique to block the transcription of Lockd RNA while leaving the Lockd DNA intact. The Cdkn1b expression was not affected. “In other words, getting rid of the RNA transcript doesn’t make a difference, but getting rid of the DNA does make a difference,” Paralkar said.
The researchers found strong evidence that in the twisted, looping, double-helix structure of the genome, the promoter end of Lockd DNA comes into direct physical contact with the promoter end of its neighbor Cdkn1b, and in that way acts as an enhancer to stimulate Cdkn1b’s transcription.
Paralkar acknowledged that the Lockd RNA may one day be found to have some other function. “It’s impossible to prove absolutely that it has no function—but it seems at least that it has no obvious function in regulating its neighbor Cdkn1b,” he said. He emphasized, however, that in determining the function of non-coding DNA and RNA, both DNA-deletion and RNA-blocking experiments—as in this study—are needed to distinguish the function of DNA from its RNA product.
“One has to decouple the transcript from the DNA,” Paralkar said. “Future studies of lncRNA function should adhere to that requirement.”
He added that the discovery of this enhancer function for one example of a lncRNA gene points to the possibility that this is a broadly used mechanism in the genome, found in non-coding and perhaps even some protein-coding genes. Indeed, enhancers are theorized to be one of the key genomic features that distinguishes species such as mice and humans—which share nearly all their protein-coding genes, but relatively few of their enhancers and lncRNA-coding genes.
“The fact that mice and humans are so different may be due largely to the fact that their genes are being regulated so differently by enhancers, some of which produce RNA molecules that we detect as lncRNAs,” Paralkar said.
Other co-authors of the study were Cristian C. Taborda, Peng Huang, Rishi Prasad, and Gerd A. Blobel of the Children’s Hospital of Philadelphia; Yu Yao of St. Jude Children’s Research Hospital; Andrew V. Kossenkov of the Wistar Institute; Jing Luan of Penn Medicine; James O. J. Davies and Jim R. Hughes of Oxford University; and Ross C. Hardison of Pennsylvania State University.
Funding was provided by the National Institute of Diabetes and Digestive and Kidney Diseases (K08 1K08DK102533-01A1, R01 DK092318, R56 DK065806), the Wellcome Trust (098931/Z/12/Z, 106130/Z/14/Z) and the UK Medical Research Council (4050189188).
Penn Medicine is one of the world’s leading academic medical centers, dedicated to the related missions of medical education, biomedical research, excellence in patient care, and community service. The organization consists of the University of Pennsylvania Health System and Penn’s Raymond and Ruth Perelman School of Medicine, founded in 1765 as the nation’s first medical school.
The Perelman School of Medicine is consistently among the nation's top recipients of funding from the National Institutes of Health, with $550 million awarded in the 2022 fiscal year. Home to a proud history of “firsts” in medicine, Penn Medicine teams have pioneered discoveries and innovations that have shaped modern medicine, including recent breakthroughs such as CAR T cell therapy for cancer and the mRNA technology used in COVID-19 vaccines.
The University of Pennsylvania Health System’s patient care facilities stretch from the Susquehanna River in Pennsylvania to the New Jersey shore. These include the Hospital of the University of Pennsylvania, Penn Presbyterian Medical Center, Chester County Hospital, Lancaster General Health, Penn Medicine Princeton Health, and Pennsylvania Hospital—the nation’s first hospital, founded in 1751. Additional facilities and enterprises include Good Shepherd Penn Partners, Penn Medicine at Home, Lancaster Behavioral Health Hospital, and Princeton House Behavioral Health, among others.
Penn Medicine is an $11.1 billion enterprise powered by more than 49,000 talented faculty and staff.