Bionano Genomics: Disrupting Traditional Karyotyping Using Optical Mapping And Cytogenetics ( Part 1)

Seeking Alpha Analyst Since 2020
Summary
- Saphyr: Ultra-sensitive and ultra-specific structural variation detection enabling researchers and clinicians to accelerate the search for new diagnostics and therapeutic targets and to streamline the study of changes.
- Structural variation refers to large-scale structural differences in the genomic DNA of one individual compared to another.
- 138 installed as of Dec 2019. Market opportunity to install: 9500 such systems. 2500 cytogenetics labs across the world.
- Wide medical usage including rare and undiagnosed pediatric diseases, muscular diseases,developmental delays and disorders, prostate cancer and leukemia.
- Saphyr’s performance characteristics include up to 99% sensitivity, less than 2% false positive rates and accurate diagnosis with allele fraction, or the percentage ofsample exhibiting the variant, as low as between 3-5%.
Genome analysis markets for products and services has been growing and expected to reach $23.9 by 2022. There are two major areas:
- Sequencing for Discovery Research. In discovery research across patient cohorts, sequencing is primarily used to find single nucleotide variations responsible for disease or therapeutic response. Sequencing alone, however, is significantly limited due to its inability to reveal structural variations. Our Saphyr system has been expanding this market segment by complementing sequencing to expand the scope of genome variation that can be analyzed in a study and achieve a more comprehensive view of the genome.
- Cytogenetics. To provide a clinical diagnosis, cytogenetic tests detect known variations that are linked to specific diseases or therapeutic responses. The technologies used for detecting structural variations are expensive and involve cumbersome workflows with relatively limited ability to scale to higher volumes or more complex testing panels. Sequencers tend not to be used for cytogenetics due to their inability to reliably detect structural variations. Cytogenetics laboratories are beginning to adopt Saphyr as a more effective and efficient approach to finding the structural variations relevant to cytogenetics. For this segment, Saphyr is used alone to provide comprehensive detection of structural variations and enable diagnostic calls without the need for any sequencing or cytogenetic technology.
Bionano Market Opportunity
- Disrupt the existing karyotyping, fishing and other antiquated technologies in the 2500 labs across the world today.
- With an increase in adoption of such labs and interest, the number of labs should continue to increase
In addition to the instrument sales opportunity, Saphyr instruments generate recurring revenue from chip consumables that are used on a per-sample basis. We believe each Saphyr instrument has the potential to create recurring revenue in a range of approximately $60,000 to approximately $150,000 per year, suggesting a potential annual recurring revenue opportunity of approximately $0.6 billion to approximately $1.4 billion
Existing Technologies and Their Limitations
Existing technologies fail to adequately address the need for structural variation detection because they do not overcome the inherent complexity of the genome or they are not capable of providing a cost-effective, scalable solution to meet the increasing demands of genomics research and clinical applications
The Limitations of Sequencing
As the first complete draft of the human genome was being assembled in 2000, the belief arose that most questions in genome analysis could be addressed by sequencing. Over the course of over 15 years, sequencing proliferated across the entire genome analysis community with Illumina becoming the clear sequencing industry leader. As more sequencing data emerged, it became apparent that sequencing alone would not adequately elucidate the causes of human disease.
The promise of sequencing was not fully delivered due to sequencing’s inability to address the complexities of genome composition. Nearly all genome sequencing, including next-generation sequencing, uses a method called sequencing by synthesis. Sequencing by synthesis is an in-vitro process for synthesizing a copy of DNA, one base at a time in a way that makes it possible to measure the identity of each base as it is incorporated into the growing DNA copy. Sequencing by synthesis involves cutting genomic DNA into small pieces of a few hundred base pairs each, amplifying these pieces many times and anchoring them to a solid support where they are copied by a polymerase using fluorescently labeled bases.
These copies are called sequencing reads. Illumina, which is the world leader in next-generation sequencing technology, markets systems that provide average read lengths that are 100 to 300 base pairs long. These short reads are matched by computer programs to a reference genome in a process called alignment. The reference is meant to represent the “standard” human genome in a normal, non-diseased state. It is the result of billions of dollars spent on the Human Genome Project and other initiatives begun in the late 1990s and early 2000s to put together the first complete set of human DNA code. When a patient’s genome is sequenced today, the short reads are compared against the reference as a template. Using this approach, sequencing attempts to reconstruct, or “resequence,” the genome and infer genome variations
The read lengths typical for next-generation sequencing are often too short to determine the right location and orientation of a reading frame in the genome because many of the reads from one chromosome are identical to reads from either another chromosome or even another location on the same chromosome.
When reads are indistinguishable from one another, computations cannot be performed to place the reads in the correct location in the genome. The other significant limitation with next-generation sequencing is that the genome fragments used as templates in the copying process are also very short. This fragmentation is a result of the methods used for DNA isolation from the cell and the use of polymerase chain reaction, or PCR. These short lengths disconnect and destroy most of the structural information of the original genome and make next-generation sequencing unable to reliably detect genomic variations larger than a few hundred base pairs. If the sequencing reads were accurate, on the order of hundreds of thousands of base pairs long and from templates that were even longer, they would overcome the redundancy of genome composition and every read would have a unique position in the genome.
It would then be possible to assemble a structurally accurate picture of the genome. Accurate structural variation would be revealed upon comparing structurally accurate assemblies of genomes across a population to determine the structural changes that are driving the observed pathology or physiology. The recognition of the need for greater lengths of sequence reads to determine genome structure, birthed the so-called long-read sequencing submarket. Because of the need for long-read sequencing, Pacific Biosciences of California developed a system that uses another alternative form of sequencing by synthesis, while Oxford Nanopore Technologies developed a system that uses nanopore technology. These systems provide users with average read lengths in the tens of thousands of base pairs. However, these read lengths have proven not to be long enough to reliably and comprehensively detect structural variations.
Pacific Biosciences’ polymerases cannot regularly produce reads that are the necessary hundreds of thousands of base pairs in length. In addition, Oxford Nanopore’s system has difficulty reliably feeding molecules that are, on average, hundreds of thousands of base pairs in length through each nanopore. The time and cost of providing a comprehensive whole genome analysis of a patient in a clinical setting is prohibitive when using these longer-read technologies
Limitations of current technologies
- Karyotyping Karyotyping is the gross optical examination of the chromosomes using a microscope. It is a laboratory technique, modernized in the 1960s, whereby the chromosomes from one cell are stained and visualized by a pathologist or technician to investigate the total number and structure of chromosomes. Karyotyping has many limitations. It is cell culture dependent and therefore requires live and actively dividing cells. Karyotyping has extremely low resolution and is therefore only sensitive for very large structural variations that are unambiguous to identify. Given that chromosomes are being directly viewed on a slide by a pathologist with a microscope, resolution tends to be limited to structural events that are larger than five million base pairs. When karyotyping is used to diagnose unknown genetic disease, only about 5% of karyotyping tests result in a confirmed pathogenic finding. The test is costly, and its results are subject to each pathologist’s interpretation which introduces variability in diagnostic calls and makes the methodology not amenable to automation.
- FISH FISH is a molecular cytogenetic technique that is used to detect chromosomal aberrations. FISH is based on the idea of using a specifically developed probe to detect a particular gene abnormality that is suspected to be in the genome. When the probe finds targeted variation, it binds to it and generates a fluorescent signal which is observed with a fluorescence microscope. Several characteristics of FISH limit its productivity and efficiency for use in structural variation detection. Like karyotyping, it is cell culture dependent and therefore requires live and actively dividing cells. Also, FISH is limited to known targets and cannot be used for discovery. Every FISH test performed needs to be chosen to look for a specific genetic marker that the clinician anticipates may be found based on the clinical symptoms of the patient. In addition, the test results can be ambiguous and inconclusive, and reproducibility
- Microarrays Chromosomal microarrays and SNP (single nucleotide polymorphism) arrays are tests consisting of slides that contain thousands of spots of DNA fragments that bind to the DNA of the sample. Microarrays detect gains and losses of specifically chosen DNA sequence and can also infer gene expression levels. Microarrays interrogate thousands of genes simultaneously that are known to be associated with presumed genetic disorders of interest to the user. Probe coverage is typically highly focused in regions of known clinical significance. Microarrays have limited utility as a diagnostic tool as they are only useful when there are gains and losses of base pairs within the sample’s genome that are specific to the probes that are populated on the array. Microarrays are also limited in their ability to provide specific locations of gains or losses on a chromosome, or even identify on which chromosome that the gains or losses occur. In addition, microarrays have low resolution as they cannot reliably detect structural variants smaller than 50,000 base pairs. Also, the diagnostic yield of microarrays is low. Only an estimated 20% of microarray tests provide a confirmed pathogenic finding when used to diagnose unknown genetic disease
Analyst's Disclosure: I am/we are long BNGO.
Information has been sourced from the 10-k filing of BNGO Dec 2019
Seeking Alpha's Disclosure: Past performance is no guarantee of future results. No recommendation or advice is being given as to whether any investment is suitable for a particular investor. Any views or opinions expressed above may not reflect those of Seeking Alpha as a whole. Seeking Alpha is not a licensed securities dealer, broker or US investment adviser or investment bank. Our analysts are third party authors that include both professional investors and individual investors who may not be licensed or certified by any institute or regulatory body.