Skip to main content
Top

12-03-2024

Efficient Classification of Hallmark of Cancer Using Embedding-Based Support Vector Machine for Multilabel Text

Authors: Shikha Verma, Aditi Sharan, Nidhi Malik

Published in: New Generation Computing

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The Hallmark of Cancers consists of various biological capabilities of the tumor cell which help the medical experts to understand the development and identification of these cells during various stages of the cancer disease. The hallmark of cancer classification is a widely accepted framework that characterizes the fundamental biological capabilities of cancer cells. This classification is based on the work of Hanahan and Weinberg, who identified 10 hallmark capabilities that collectively enable the development and progression of cancer. The hallmark of cancer classification provides a comprehensive framework for understanding the biological basis of cancer development and progression. It helps researchers to identify the key molecular and cellular pathways that are involved in the disease, which can inform the development of new diagnostic tools and therapies. Multi-label classification aims to assign a set of labels to the samples under study. This paper focuses on creating an improved model by hybridizing the biomedical domain-specific embeddings for all the extracted biomedical features on the machine learning model. The use of domain-specific embeddings adds semantics to the vector-represented text. More specifically the study has tried to improve the efficacy of the multi-label classification as compared with other state-of-art methods using BioWordVec and the MeSH embeddings. The experimental work showed a significant improvement in the performance of our model which is being trained on the machine learning algorithm Support Vector Machine (SVM). The paper also focuses on understanding the label correlation which is studied by conducting a case study with medical domain experts and is also analyzed with the proposed model.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
2.
4.
go back to reference Budhiraja, M.: Multi label text classification for untrained data through supervised learning. In: 2017 International Conference on Intelligent Computing and Control (I2C2). Presented at the 2017 International Conference on Intelligent Computing and Control (I2C2), pp. 1–3 (2017). https://doi.org/10.1109/I2C2.2017.8321804 Budhiraja, M.: Multi label text classification for untrained data through supervised learning. In: 2017 International Conference on Intelligent Computing and Control (I2C2). Presented at the 2017 International Conference on Intelligent Computing and Control (I2C2), pp. 1–3 (2017). https://​doi.​org/​10.​1109/​I2C2.​2017.​8321804
5.
go back to reference Cerri, R., da Silva, R.R.O., de Carvalho, A.C.P.L.F.: Comparing methods for multilabel classification of proteins using machine learning techniques. In: Guimarães, K.S., Panchenko, A., Przytycka, T.M. (eds.) Advances in Bioinformatics and Computational Biology, pp. 109–120. Springer, Berlin, Heidelberg (2009). https://doi.org/10.1007/978-3-642-03223-3_10 Cerri, R., da Silva, R.R.O., de Carvalho, A.C.P.L.F.: Comparing methods for multilabel classification of proteins using machine learning techniques. In: Guimarães, K.S., Panchenko, A., Przytycka, T.M. (eds.) Advances in Bioinformatics and Computational Biology, pp. 109–120. Springer, Berlin, Heidelberg (2009). https://​doi.​org/​10.​1007/​978-3-642-03223-3_​10
7.
go back to reference Doan, S., Kawazoe, A., Collier, N.: The role of roles in classifying annotated biomedical text. In: Biological, Translational, and Clinical Language Processing, pp. 17–24. Prague, Czech Republic, Association for Computational Linguistics (2007) Doan, S., Kawazoe, A., Collier, N.: The role of roles in classifying annotated biomedical text. In: Biological, Translational, and Clinical Language Processing, pp. 17–24. Prague, Czech Republic, Association for Computational Linguistics (2007)
8.
go back to reference Guo, H., Li, X., Zhang, L., Liu, J., Chen, W.: Label-aware text representation for multi-label text classification. In: ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Presented at the ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 7728–7732 (2021). https://doi.org/10.1109/ICASSP39728.2021.9413921 Guo, H., Li, X., Zhang, L., Liu, J., Chen, W.: Label-aware text representation for multi-label text classification. In: ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Presented at the ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 7728–7732 (2021). https://​doi.​org/​10.​1109/​ICASSP39728.​2021.​9413921
14.
go back to reference Verma, S., Sharan, A.: Incorporating semantics for text classification in biomedical domain. in Proceedings of the International Health Informatics Conference, Jain, S., Groppe, S., Mihindukulasooriya, N. Eds., in Lecture Notes in Electrical Engineering. Singapore: Springer Nature, 2023, pp. 185–197. https://doi.org/10.1007/978-981-19-9090-8_17 Verma, S., Sharan, A.: Incorporating semantics for text classification in biomedical domain. in Proceedings of the International Health Informatics Conference, Jain, S., Groppe, S., Mihindukulasooriya, N. Eds., in Lecture Notes in Electrical Engineering. Singapore: Springer Nature, 2023, pp. 185–197. https://​doi.​org/​10.​1007/​978-981-19-9090-8_​17
18.
go back to reference Xun, G., Jha, K., Yuan, Y., Zhang, A.: Topic discovery for biomedical corpus using MeSH Embeddings. In: 2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI). Presented at the 2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI), pp. 1–4 (2019). https://doi.org/10.1109/BHI.2019.8834559 Xun, G., Jha, K., Yuan, Y., Zhang, A.: Topic discovery for biomedical corpus using MeSH Embeddings. In: 2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI). Presented at the 2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI), pp. 1–4 (2019). https://​doi.​org/​10.​1109/​BHI.​2019.​8834559
19.
go back to reference Yang, J., Bai, L., Guo, Y.: A survey of text classification models. In: Proceedings of the 2020 2nd International Conference on Robotics, Intelligent Control and Artificial Intelligence, RICAI 2020, pp. 327–334. Association for Computing Machinery, New York, NY (2020). https://doi.org/10.1145/3438872.3439101 Yang, J., Bai, L., Guo, Y.: A survey of text classification models. In: Proceedings of the 2020 2nd International Conference on Robotics, Intelligent Control and Artificial Intelligence, RICAI 2020, pp. 327–334. Association for Computing Machinery, New York, NY (2020). https://​doi.​org/​10.​1145/​3438872.​3439101
20.
go back to reference Yu, T., Li, T., Wang, X.: Multi-label text classification with label correction under noise. In: 2021 10th International Conference on Computing and Pattern Recognition, ICCPR 2021, pp. 169–174. Association for Computing Machinery, New York, NY (2021). https://doi.org/10.1145/3497623.3497650 Yu, T., Li, T., Wang, X.: Multi-label text classification with label correction under noise. In: 2021 10th International Conference on Computing and Pattern Recognition, ICCPR 2021, pp. 169–174. Association for Computing Machinery, New York, NY (2021). https://​doi.​org/​10.​1145/​3497623.​3497650
Metadata
Title
Efficient Classification of Hallmark of Cancer Using Embedding-Based Support Vector Machine for Multilabel Text
Authors
Shikha Verma
Aditi Sharan
Nidhi Malik
Publication date
12-03-2024
Publisher
Springer Japan
Published in
New Generation Computing
Print ISSN: 0288-3635
Electronic ISSN: 1882-7055
DOI
https://doi.org/10.1007/s00354-024-00248-3

Premium Partner