Integrative Analysis of Gene Expression Data by RNA Sequencing for Differential Diagnosis of Acute Leukemia: Potential Application of Machine Learning

Jaewoong Lee, Sungmin Cho, Seong Eui Hong, Dain Kang, Hayoung Choi, Jong Mi Lee, Jae Ho Yoon, Byung Sik Cho, Seok Lee, Hee Je Kim, Myungshin Kim, Yonggoo Kim

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

BCR-ABL1–positive acute leukemia can be classified into three disease categories: B-lymphoblastic leukemia (B-ALL), acute myeloid leukemia (AML), and mixed-phenotype acute leukemia (MPAL). We conducted an integrative analysis of RNA sequencing (RNA-seq) data obtained from 12 BCR-ABL1–positive B-ALL, AML, and MPAL samples to evaluate its diagnostic utility. RNA-seq facilitated the identification of all p190 BCR-ABL1 with accurate splicing sites and a new gene fusion involving MAP2K2. Most of the clinically significant mutations were also identified including single-nucleotide variations, insertions, and deletions. In addition, RNA-seq yielded differential gene expression profile according to the disease category. Therefore, we selected 368 genes differentially expressed between AML and B-ALL and developed two differential diagnosis models based on the gene expression data using 1) scoring algorithm and 2) machine learning. Both models showed an excellent diagnostic accuracy not only for our 12 BCR-ABL1–positive cases but also for 427 public gene expression datasets from acute leukemias regardless of specific genetic aberration. This is the first trial to develop models of differential diagnosis using RNA-seq, especially to evaluate the potential role of machine learning in identifying the disease category of acute leukemia. The integrative analysis of gene expression data by RNA-seq facilitates the accurate differential diagnosis of acute leukemia with successful detection of significant gene fusion and/or mutations, which warrants further investigation.

Original languageEnglish
Article number717616
JournalFrontiers in Oncology
Volume11
DOIs
StatePublished - 23 Aug 2021

Bibliographical note

Publisher Copyright:
© Copyright © 2021 Lee, Cho, Hong, Kang, Choi, Lee, Yoon, Cho, Lee, Kim, Kim and Kim.

Keywords

  • acute leukemia
  • BCR-ABL1
  • expression
  • gene fusion
  • machine learning
  • mixed-phenotype acute leukemia
  • mutation
  • RNA sequencing

Fingerprint

Dive into the research topics of 'Integrative Analysis of Gene Expression Data by RNA Sequencing for Differential Diagnosis of Acute Leukemia: Potential Application of Machine Learning'. Together they form a unique fingerprint.

Cite this