Analytical challenges in the era of big data
Alvin D. Jeffery, PhD, RN-BC, CCRN-K, FNP-BC
- Sigma Affiliation
- Iota at-Large
Visits vs Downloads
Visitors - World Map
Top Visiting Countries
Top Visiting Cities
Visits (last 6 months)
Downloads (last 6 months)
Popular Works for Jeffery, Alvin Dean by View
Popular Works for Jeffery, Alvin Dean by Download
Purpose: The popularity of “big data” along with an increasing capacity for real-time predictive analytics holds significant promise for nurses and other clinicians to gain new insights and develop novel decision support tools from our large clinical datasets. Unfortunately, these large datasets are not the panacea that some big data proponents would taut. For nurses with vast subject matter expertise in a clinical area who desire to leverage big data for solving practical problems, road blocks quickly surface in the form of acquisition and management of data, missing data, meeting assumptions of statistical models, and model evaluation for statistical and clinical performance. This talk will engage the audience in addressing these issues using an exemplar of the development of a prediction model for in-hospital cardiopulmonary arrest.
Methods: The following 4 topics will be addressed:
Data Acquisition and Management: From ethics approval to ensuring individual patient privacy to preventing undesired user access, collecting and storing “big data” is no simple task. The presenter will provide: (a) an overview of key concepts, (b) an exemplar for constructing a data acquisition and management team, and (c) several resources for learning more independently.
Missing Data: Almost all large datasets contain some amount of missing data. Regardless of the amount, finding the cause of missingness is of paramount importance. Approaches to determining a cause will be introduced, and disadvantages of complete case analysis will be described. Advantages and disadvantages of median imputation, multiple imputation, and machine learning imputation will be compared.
Statistical Model Assumptions: There are a variety of statistical models available, and with recent advances in machine learning methods, more approaches to retrieve information from the data are available to a wide array of users. An overview of the purpose and requirements of traditional modeling (e.g., logistic and linear regression) and machine learning approaches (e.g., random forests and cluster analyses) will be provided.
Model Evaluation: Determining how well a model performs on the current data and how well it is expected to perform on future data is essential in determining whether or not the model is helpful for clinical care. Internal (e.g., bootstrapping and cross-validation) versus external validation (e.g., split sample and chronological validation) techniques will be presented along with their respective advantages and disadvantages.
Results: Our in-hospital cardiopulmonary arrest prediction model required a team-based approach to solving the aforementioned challenges, and the audience will hear not only how we chose to solve the problems but also other approaches we considered. From the perspective of data acquisition/management, we found the best approach to be the inclusion of database and informatics specialists who used structured query language to extract the relevant data and then store it on a secure, organizational server. Following a simulation study, we discovered the missing data problem was best resolved by creating a multiple imputation model that included the outcome variable. Statistical model assumptions were best met by not assuming linearity while not permitting too many spline knots. Model evaluation comprised internal bootstrap validation for the regression models and split-sample validation for the machine learning methods.
Conclusion: Arriving at clinically meaningful insights contained within large datasets requires multifaceted expertise and teamwork. Nurses and other clinicians are the best members of the team to identify a problem that “big data” can help solve. To ensure a clinically meaningful solution surfaces from big data efforts, nurses should be aware of common challenges in big data research. As nurses become more knowledgeable, they position themselves to be leaders in these research teams and advocates for implementation of novel findings.
Event Theme: Influencing Global Health Through the Advancement of Nursing Scholarship
Items submitted to a conference/event were evaluated/peer-reviewed at the time of abstract submission to the event. No other peer-review was provided prior to submission to the Henderson Repository.
|Abstract Review Only: Reviewed by Event Host
Clinical Decision Support;
All rights reserved by the author(s) and/or publisher(s) listed in this item record unless relinquished in whole or part by a rights notation or a Creative Commons License present in this item record.
All permission requests should be directed accordingly and not to the Sigma Repository.
All submitting authors or publishers have affirmed that when using material in their work where they do not own copyright, they have obtained permission of the copyright holder prior to submission and the rights holder has been acknowledged as necessary.
Showing items related by title, author, creator and subjects.
Impact of real-time prediction model-enhanced clinical decision support systems on nursing sensitive patient outcomes: A review of the literature Jeffery, Alvin Dean (2016-03-17)Session presented on Saturday, July 25, 2015: Background: The popularity of 'big data' along with an increasing capacity for real-time predictive analytics to augment clinical decision support systems (CDSS) within electronic ...
Leveraging statistical simulations to gain insights from data: A new type of simulation for nurses Jeffery, Alvin Dean (2017-07-27)This presentation will provide several examples of statistical simulation studies in order to demonstrate the benefit of these techniques for nurse scientists working with quantitative data.
Statistical modeling approaches and user-centered design for nursing decision support tools predicting in-hospital cardiopulmonary arrest Jeffery, Alvin DeanThis doctoral research explored strategies for the design and statistical development of probability-based nursing decision support tools within the clinical context of in-hospital cardiopulmonary arrest (IHCPA). IHCPA ...
Jeffery, Alvin Dean; Jarvis, Robin L.; Word-Allen, Amy J. (Sigma Theta Tau International, 2018)This item record pertains to an excerpt from Staff Educator's Guide to Clinical Orientation (2nd ed.), provided free of charge by the publisher. About this book: Staff Educator's Guide to Clinical Orientation guides you ...
Jeffery, Alvin Dean; Myers, Lynnea; Werthman, Jennifer Ann; Nimmagadda, Heather Lynn (2016-03-17)Session presented on Saturday, July 25, 2015: As the global healthcare landscape continues to evolve from factors such as technological changes, socioeconomic shifts, and chronic disease emergence, a well-equipped cadre ...