-

Inocras Unveils Cancer Foundation Model Trained on Thousands of Whole Genomes

New AI model introduces learnable tokenization and genome-wide feature aggregation, advancing the use of whole genome data in precision oncology.

SAN DIEGO--(BUSINESS WIRE)--Inocras, a bioinformatics-led company harnessing the power of whole genome data and proprietary analytics to deliver curated insights that advance precision health, today announced a groundbreaking cancer foundation model trained on 2,882 whole genomes across diverse cancer types, marking a major advance in applying AI to precision oncology. Developed in collaboration with the Korea Advanced Institute of Science and Technology (KAIST), the model couples a novel learnable tokenization method with a cancer-aware embedding framework that integrates diverse genomic features into patient-level representations, capturing the molecular patterns that drive tumor biology and clinical outcomes.

At the core of this effort is DNAChunker, a proprietary model that tokenizes genomic sequences adaptively through an H-Net–based hierarchical dynamic chunking architecture. Unlike conventional methods that process DNA in fixed units, DNAChunker focuses on regions of high biological signal and compresses low-information regions - optimizing both precision and computational efficiency.

The result is a state-of-the-art, biologically informed model that achieves higher accuracy with fewer parameters and lower computational cost. In benchmark evaluations across genomic representation learning and functional prediction tasks, the model achieved state-of-the-art performance, outperforming leading DNA foundation models such as Nucleotide Transformer and DNABERT-2. When tested for clinical relevance by aggregating mutation-level embeddings generated by DNAChunker across nearly 3,000 cancer genomes, it reached 98% accuracy for HRD prediction and 84% for PAM50 subtype classification from DNA data alone on independent datasets - establishing one of the first direct links between a DNA-based foundation model and clinically relevant cancer biology.

“This represents a pivotal leap toward clinically interpretable, AI-native cancer genomics,” said Jehee Suh, CEO of Inocras. “We have moved from sequencing genomes to understanding them. That’s the inflection point AI brings to oncology. This is where data becomes diagnosis, and where genome insights start to truly shape patient care.”

Inocras’s research will be featured next week at the EMBL Cancer Genomics Conference in Heidelberg, Germany. Young Seok Ju, Ph.D., Co-founder of Inocras and one of this year’s organizers, will present “A Cancer Foundation Model from 1,364 Breast Cancer Whole Genomes for Patient Stratification” on November 11 at 16:15 CET, highlighting findings from a 1,364-patient cohort and how the DNAChunker-powered Genomic Foundation Model enables AI-driven patient stratification and molecular subtype discovery. The full conference program is available here.

The Inocras cancer foundation model marks a major step toward embedding-based patient stratification from genomic data, strengthening the company’s foundation for next-generation precision diagnostics.

About Inocras

Inocras is a bioinformatics-led company redefining precision health through whole genome data and proprietary analytics. Our oncology and rare disease platforms integrate comprehensive whole genome data with advanced automation to deliver curated and actionable insights at scale that accelerate discovery and diagnostics to improve patient care, bringing a real-world impact. Inocras operates a CLIA/CAP-certified laboratory and partners with leading hospitals, pharmaceutical companies, and research institutions worldwide. For more information, please visit inocras.com and follow the Inocras LinkedIn page.

Contacts

Media Contact:
Kristen White
Oak Street Communications
kristen@oakstreetcommunications.com

Inocras


Release Versions

Contacts

Media Contact:
Kristen White
Oak Street Communications
kristen@oakstreetcommunications.com

Social Media Profiles
More News From Inocras

Inocras, Samsung Medical Center and Seoul St. Mary’s Hospital Researchers Publish Groundbreaking Study with 1,364 Whole-Genomes of Breast Cancer in Nature

SAN DIEGO & SEOUL, South Korea--(BUSINESS WIRE)--A research consortium led by Samsung Medical Center, Seoul St. Mary’s Hospital and bioinformatics partner Inocras Inc. has published the most comprehensive whole-genome breast cancer study of its kind in Nature. Using Inocras’s proprietary whole-genome analysis platform CancerVision™, the team analyzed 1,364 breast cancer genomes, together with transcriptomic data and matched clinical records. With more than 10.9 million somatic mutations mapped,...

STREAM Consortium and Inocras Unveil Landmark National Pediatric Cancer WGS Program, funded by the Lee Kun-hee Child Cancer and Rare Disease Project

SAN DIEGO & BOSTON--(BUSINESS WIRE)--Inocras, a bioinformatics-led company harnessing the power of whole genome data and proprietary analytics to deliver curated insights that advance precision health, in collaboration with the Lee Kun-hee Child Cancer and Rare Disease Project, today announced results from the STREAM Program (Strategic TREatment And Magic for pediatric cancers). The project, established through a donation from the family of the late Samsung Chairman Lee Kun-hee, is the first na...

Inocras Expands WGS-Based Cancer Precision Diagnostics CancerVision™ to 10 Hospitals

SAN DIEGO--(BUSINESS WIRE)--Today, Inocras, a leader in whole genome sequencing (WGS) and bioinformatics, operating a CLIA/CAP-certified laboratory in San Diego, announced that its whole genome sequencing (WGS)-based cancer diagnostics platform, CancerVision, has been officially listed and is available for real-world patient care at 10 major hospitals across South Korea. The hospitals that have formally listed CancerVision for patient care include leading academic medical centers, university ho...
Back to Newsroom