
Researchers generated the first complete, gapless sequence of a human genome
On Mar. 31, 2022, researchers announced they have created a complete, gap-free sequence of the roughly 3 billion bases (or “letters”) in our DNA is critical for understanding the full spectrum of human genomic variation and for understanding the genetic contributions to certain diseases.
The work was done by the Telomere to Telomere (T2T) consortium, which included leadership from researchers at the National Human Genome Research Institute (NHGRI), part of the National Institutes of Health; University of California, Santa Cruz; and University of Washington, Seattle. NHGRI was the primary funder for the study.
Analyses of the complete genome sequence will significantly add to our knowledge of chromosomes, including more accurate maps for five chromosome arms, which opens new lines of research. This helps answer basic biology questions about how chromosomes properly segregate and divide. The T2T consortium used the now-complete genome sequence as a reference to discover more than 2 million additional variants in the human genome. These studies provide more accurate information about the genomic variants within 622 medically relevant genes.
The now-complete human genome sequence will be particularly valuable for studies that aim to establish comprehensive views of human genomic variation, or how people’s DNA differs. Such insights are vital for understanding the genetic contributions to certain diseases and for using genome sequence as a routine part of clinical care in the future. Many research groups have already started using a pre-release version of the complete human genome sequence for their research.
The full sequencing builds upon the work of the Human Genome Project, which mapped about 92% of the genome, and research undertaken since then. Thousands of researchers have developed better laboratory tools, computational methods and strategic approaches to decipher the complex sequence. Six papers encompassing the completed sequence appear in Science, along with companion papers in several other journals.
That last 8% includes numerous genes and repetitive DNA and is comparable in size to an entire chromosome. Researchers generated the complete genome sequence using a human cell line with only one copy of each chromosome, unlike most human cells, which carry two copies of each chromosome. The researchers noted that most of the newly added DNA sequences were near the repetitive telomeres (long, trailing ends of each chromosome) and centromeres (dense middle sections of each chromosome).
Over the past decade, two new DNA sequencing technologies emerged that produced much longer sequence reads. The Oxford Nanopore DNA sequencing method can read up to 1 million DNA letters in a single read with modest accuracy, while the PacBio HiFi DNA sequencing method can read about 20,000 letters with nearly perfect accuracy. Researchers in the T2T consortium used both DNA sequencing methods to generate the complete human genome sequence.
Tags:
Source: National Human Genome Research Institute
Credit:
