
Big Data in Pediatric Oncology and Rare Diseases
Through its Sandpit funding program, Wübben Stiftung Wissenschaft supports the development of innovative and highly relevant research approaches. The first call for proposals focused on projects that unlock new ways of using big data in clinical research. Two promising initiatives have now been selected for funding: a project to build a synthetic health data repository called SHARE, and a pioneering project for home-based health monitoring in pediatric oncology.
The future of childhood cancer treatment: a smart, medically safe home 
The aim of this Sandpit project is to enable proactive, personalized treatment of children with cancer in their home environment. “Looking after children with cancer is emotionally taxing,” says project spokesperson Dominik Schöndorf from University Hospital Saarland. “Parents are often unsure when they require medical assistance, and especially when they need to take their child to hospital outside of scheduled appointments. Our project is intended to prevent unnecessary stays in hospital.” The Sandpit project aims to put in place infrastructure that will make use of big data, artificial intelligence, and machine learning to predict the medical needs of children before an emergency situation arises. The idea is that the patient’s home environment will provide a constant stream of health information so that warning signs can be spotted at an early stage. These could be changes in vital signs, altered behavior, or environmental factors. It is hoped that intelligent networking between home and hospital will prioritize quality of care for patients and quality of life for families. 
Building SHARE: the Synthetic Health Data Repository
An international team is planning to build a synthetic health data repository for rare diseases. The aim is to make standardized data freely available to researchers to train AI models and to conduct data-driven research, facilitating research in the field of healthcare and pharmaceuticals. “The need for large volumes of data for AI models in healthcare is increasing exponentially,” says project spokesperson Jannik Schaaf from Goethe University Frankfurt. “Particularly for rare diseases, of which there are more than 7,000, there are hardly any statistically relevant patient datasets and only a few AI models.” Synthetic data is generated artificially and can reflect statistical properties of real data. This type of data is vital for clinical research, especially in the area of rare diseases. Another problem the project aims to address concerns global comparability because it is often not possible to access the data underlying existing trained AI models. SHARE is designed to close these gaps and provide standardized data for researchers worldwide.
Wübben Stiftung Wissenschaft’s Sandpit program is a three-day event format that brings together participants in interdisciplinary yet unfamiliar teams to identify pioneering research avenues or questions. Further information about the program can be found here.