Inicio  /  Information  /  Vol: 14 Par: 2 (2023)  /  Artículo
ARTÍCULO
TITULO

Challenges Encountered and Lessons Learned when Using a Novel Anonymised Linked Dataset of Health and Social Care Records for Public Health Intelligence: The Sussex Integrated Dataset

Elizabeth Ford    
Richard Tyler    
Natalie Johnston    
Vicki Spencer-Hughes    
Graham Evans    
Jon Elsom    
Anotida Madzvamuse    
Jacqueline Clay    
Kate Gilchrist and Melanie Rees-Roberts    

Resumen

Background: In the United Kingdom National Health Service (NHS), digital transformation programmes have resulted in the creation of pseudonymised linked datasets of patient-level medical records across all NHS and social care services. In the Southeast England counties of East and West Sussex, public health intelligence analysts based in local authorities (LAs) aimed to use the newly created ?Sussex Integrated Dataset? (SID) for identifying cohorts of patients who are at risk of early onset multiple long-term conditions (MLTCs). Analysts from the LAs were among the first to have access to this new dataset. Methods: Data access was assured as the analysts were employed within joint data controller organisations and logged into the data via virtual machines following approval of a data access request. Analysts examined the demographics and medical history of patients against multiple external sources, identifying data quality issues and developing methods to establish true values for cases with multiple conflicting entries. Service use was plotted over timelines for individual patients. Results: Early evaluation of the data revealed multiple conflicting within-patient values for age, sex, ethnicity and date of death. This was partially resolved by creating a ?demographic milestones? table, capturing demographic details for each patient for each year of the data available in the SID. Older data (=5 y) was found to be sparse in events and diagnoses. Open-source code lists for defining long-term conditions were poor at identifying the expected number of patients, and bespoke code lists were developed by hand and validated against other sources of data. At the start, the age and sex distributions of patients submitted by GP practices were substantially different from those published by NHS Digital, and errors in data processing were identified and rectified. Conclusions: While new NHS linked datasets appear a promising resource for tracking multi-service use, MLTCs and health inequalities, substantial investment in data analysis and data architect time is necessary to ensure high enough quality data for meaningful analysis. Our team made conceptual progress in identifying the skills needed for programming analyses and understanding the types of questions which can be asked and answered reliably in these datasets.

 Artículos similares

       
 
Hui-Jun Kim, Jung-Soon Kim and Sung-Hee Kim    
The existing question-and-answer screening test has a limitation in that test accuracy varies due to a high learning effect and based on the inspector?s competency, which can have consequences for rapid-onset cognitive-related diseases. To solve this pro... ver más
Revista: Applied Sciences

 
Eugenia I. Toki, Jenny Pange, Giorgos Tatsis, Konstantinos Plachouras and Ioannis G. Tsoulos    
Autism Spectrum Disorder is known to cause difficulties in social interaction and communication, as well as repetitive patterns of behavior, interests, or hobbies. These challenges can significantly affect the individual?s daily life. Therefore, it is cr... ver más
Revista: Applied Sciences

 
Gerardo José Ginovart-Panisello, Ignasi Iriondo, Tesa Panisello Monjo, Silvia Riva, Jordi Casadó Cancer and Rosa Ma Alsina-Pagès    
Acoustic studies on poultry show that chicken vocalizations can be a real-time indicator of the health conditions of the birds and can improve animal welfare and farm management. In this study, hens vaccinated against infectious laryngotracheitis (ILT) w... ver más
Revista: Applied Sciences

 
Ichchha Pradeep Sharma, Tam V. Nguyen, Shruti Ajay Singh and Tom Ongwere    
This paper focuses on addressing the complex healthcare needs of patients struggling with discordant chronic comorbidities (DCCs). Managing these patients within the current healthcare system often proves to be a challenging process, characterized by evo... ver más
Revista: Information

 
Leon Kopitar, Iztok Fister, Jr. and Gregor Stiglic    
Introduction: Type 2 diabetes mellitus is a major global health concern, but interpreting machine learning models for diagnosis remains challenging. This study investigates combining association rule mining with advanced natural language processing to im... ver más
Revista: Information