English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  On the Application of Advanced Machine Learning Methods to Analyze Enhanced, Multimodal Data from Persons Infected with COVID-19

Weng, C., Gautam, A., & Huson, D. (2021). On the Application of Advanced Machine Learning Methods to Analyze Enhanced, Multimodal Data from Persons Infected with COVID-19. Computation, 9: 4. doi:10.3390/computation9010004.

Item is

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Weng, C, Author
Gautam, A1, Author           
Huson, DH, Author           
Affiliations:
1IMPRS From Molecules to Organisms, Max Planck Institute for Developmental Biology, Max Planck Society, ou_3376131              

Content

show
hide
Free keywords: -
 Abstract: The current COVID-19 pandemic, caused by the rapid worldwide spread of the SARS-CoV-2 virus, is having severe consequences for human health and the world economy. The virus affects different individuals differently, with many infected patients showing only mild symptoms, and others showing critical illness. To lessen the impact of the epidemic, one problem is to determine which factors play an important role in a patient’s progression of the disease. Here, we construct an enhanced COVID-19 structured dataset from more than one source, using natural language processing to add local weather conditions and country-specific research sentiment. The enhanced structured dataset contains 301,363 samples and 43 features, and we applied both machine learning algorithms and deep learning algorithms on it so as to forecast patient’s survival probability. In addition, we import alignment sequence data to improve the performance of the model. Application of Extreme Gradient Boosting (XGBoost) on the enhanced structured dataset achieves 97% accuracy in predicting patient’s survival; with climatic factors, and then age, showing the most importance. Similarly, the application of a Multi-Layer Perceptron (MLP) achieves 98% accuracy. This work suggests that enhancing the available data, mostly basic information on patients, so as to include additional, potentially important features, such as weather conditions, is useful. The explored models suggest that textual weather descriptions can improve outcome forecast.

Details

show
hide
Language(s):
 Dates: 2021-01
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: DOI: 10.3390/computation9010004
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Computation
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: Basel : MDPI
Pages: 15 Volume / Issue: 9 Sequence Number: 4 Start / End Page: - Identifier: ISSN: 2079-3197
CoNE: https://pure.mpg.de/cone/journals/resource/2079-3197