Projects per year
Abstract
Data is essential for machine learning projects, and data accuracy is crucial for being able to trust the results obtained from the associated machine learning models. Previously, we have developed machine learning models for predicting the treatment outcome for breast cancer patients that have undergone chemotherapy, and developed a monitoring system for their treatment timeline showing interactively the options and associated predictions. Available cancer datasets, such as the one used earlier, are often too small to obtain significant results, and make it difficult to explore ways to improve the predictive capability of the models further. In this paper, we explore an alternative to enhance our datasets through synthetic data generation. From our original dataset, we extract rules to generate fabricated data that capture the different characteristics inherent in the dataset. Additional rules can be used to capture general medical knowledge. We show how to formulate rules for our cancer treatment data, and use the IBM solver to obtain a corresponding synthetic dataset. We discuss challenges for future work.
Original language | English |
---|---|
Title of host publication | Rules and Reasoning |
Subtitle of host publication | 4th International Joint Conference, RuleML+RR 2020, Oslo, Norway, June 29–July 1, 2020, Proceedings |
Editors | Victor Gutiérrez Basulto, Tomáš Kliegr, Ahmet Soylu, Martin Giese, Dumitru Roman |
Place of Publication | Cham |
Publisher | Springer |
Pages | 168-176 |
Number of pages | 9 |
ISBN (Electronic) | 9783030579777 |
ISBN (Print) | 9783030579760 |
DOIs | |
Publication status | Published - 2020 |
Event | 4th International Joint Conference on Rules and Reasoning (RCUL+RR 2020) - Online, Oslo, Norway Duration: 29 Jun 2020 → 1 Jul 2020 Conference number: 4 https://2020.declarativeai.net/ |
Publication series
Name | Lecture Notes in Computer Science (Programming and Software Engineering) |
---|---|
Publisher | Springer |
Volume | 12173 LNCS |
ISSN (Print) | 0302-9743 |
Conference
Conference | 4th International Joint Conference on Rules and Reasoning (RCUL+RR 2020) |
---|---|
Abbreviated title | RCUL+RR 2020 |
Country/Territory | Norway |
City | Oslo |
Period | 29/06/20 → 1/07/20 |
Internet address |
Keywords
- Cancer data
- Synthetic data
- Constraint solvers
- Fabrication rules
Fingerprint
Dive into the research topics of 'On defining rules for cancer data fabrication'. Together they form a unique fingerprint.Projects
- 1 Finished