Rxivist logo

Automated Feature Extraction from Population Wearable Device Data Identified Novel Loci Associated with Sleep and Circadian Rhythms

By Xinyue Li, Hongyu Zhao

Posted 01 Apr 2020
bioRxiv DOI: 10.1101/2020.03.31.017608 (published DOI: 10.1371/journal.pgen.1009089)

Wearable devices have been increasingly used in research to provide continuous physical activity monitoring, but how to effectively extract features remains challenging for researchers. To analyze the generated actigraphy data in large-scale population studies, we developed computationally efficient methods to derive sleep and activity features through a Hidden Markov Model-based sleep/wake identification algorithm, and circadian rhythm features through a Penalized Multi-band Learning approach adapted from machine learning. Unsupervised feature extraction is useful when labeled data are unavailable, especially in large-scale population studies. We applied these two methods to the UK Biobank wearable device data and used the derived sleep and circadian features as phenotypes in genome-wide association studies. We identified 53 genetic loci with p<5*10-8 including genes known to be associated with sleep disorders and circadian rhythms as well as novel loci associated with Body Mass Index, mental diseases and neurological disorders, which suggest shared genetic factors of sleep and circadian rhythms with physical and mental health. Further cross-tissue enrichment analysis highlights the important role of the central nervous system and the shared genetic architecture with metabolism-related traits and the metabolic system. Our study demonstrates the effectiveness of our unsupervised methods for wearable device data when additional training data cannot be easily acquired, and our study further expands the application of wearable devices in population studies and genetic studies to provide novel biological insights.

Download data

  • Downloaded 238 times
  • Download rankings, all-time:
    • Site-wide: 138,003
    • In genetics: 5,437
  • Year to date:
    • Site-wide: 107,981
  • Since beginning of last month:
    • Site-wide: 92,180

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide