Massive genome havoc in breast cancer is revealed

Long-read sequencing enabled the team to reconstruct in great detail the history of how the HER2 gene gets massively amplified in HER2-positive breast cancer cells, says Dr. Schatz. Top rectangle shows a 2 million base-pair segment of chromosome 17 occupied by the HER2 gene (also called ERBB2). A small segment of the gene, already massively amplified, breaks off and fuses with chromosome 8 (lower rectangle). On that chromosome, parts of the gene are copied as many as 1000 times, with various segments jumping around within the chromosome (green arcs). This shows why we want to identify HER2-positive patients as early as possible, to prevent the kind of chaos that we register here cumulatively, says Schatz. Credit: Schatz Lab, CSHL/JHU

In cancer cells, genetic errors wreak havoc. Misspelled genes, as well as structural variations—larger-scale rearrangements of DNA that can encompass large chunks of chromosomes—disturb carefully balanced mechanisms that have evolved to regulate cell growth. Genes that are normally silent are massively activated and mutant proteins are formed. These and other disruptions cause a plethora of problems that cause cells to grow without restraint, cancer's most infamous hallmark.

This week, scientists at Cold Spring Harbor Laboratory (CSHL) have published in Genome Research one of the most detailed maps ever made of structural variations in a cancer cell's . The map reveals about 20,000 structural variations, few of which have ever been noted due to technological limitations in a long-popular method of .

The team, led by sequencing experts Michael C. Schatz and W. Richard McCombie, read genomes of the cancer cells with so-called long-read sequencing technology. This technology reads much lengthier segments of DNA than older short-read technology. When the results are interpreted with two sophisticated software packages recently published by the team, two advantages are evident: long-read sequencing is richer in terms of both information and context. It can, for instance, make better sense of repetitive stretches of DNA letters—which pervade the genome—in part by seeing them within a physically larger context.

The team demonstrated the power of long-read technology by using it to read the genomes of cells derived from a cell line called SK-BR-3, an important model for with variations in a gene called HER2 (sometimes also called ERBB2). About 20% of breast cancers are "HER2-positive," meaning they overproduce the HER2 protein. These cancers tend to be among the most aggressive.

"Most of the 20,000 variants we identified in this cell line were missed by short-read sequencing," says Maria Nattestad, Ph.D., who performed the work with colleagues while still a member of the Schatz lab at CSHL and Johns Hopkins University. "Of particular interest, we found a highly complex set of DNA variations surrounding the HER2 gene."

In their analysis, the team combined the results of long-read sequencing with results of another kind of experiment that reads the messages, or transcripts, that are being generated by activated genes. This fuller picture yielded an extraordinarily detailed account of how structural variations disrupt the genome in cancer and sheds light on how rapidly evolve.

Schatz, Adjunct Associate Professor at CSHL and Bloomberg Distinguished Associate Professor at Johns Hopkins University, and McCombie, a CSHL Professor, say it is "essential to continue building a catalog of variant cell types using the best available technologies. Long-read sequencing is an invaluable tool to capture the complexity of structural variations, so we expect its widespread adoption for use in research and clinical practice, especially as sequencing costs further decline."

More information: Nattestad M et al, (2018) "Complex rearrangements and oncogene amplifications revealed by long-read DNA and RNA sequencing of a breast cancer cell line" Genome Research.

Provided by: Cold Spring Harbor Laboratory