Search

Thesis
Peer Reviewed

Deep Learning for Puzzles and Circadian Rhythms

Agostinelli, Forest
Advisor(s): Baldi, Pierre

UC Irvine Electronic Theses and Dissertations (2019)

The combination of deep learning with reinforcement learning and the application of deep learning to the sciences is a relatively new and flourishing field. We show how deep reinforcement learning techniques can learn to solve problems, often in the most efficient way possible, when faced with many possibilities but little information by designing an algorithm that can learn to solve seven different combinatorial puzzles, including the Rubik's cube. Furthermore, we show how deep learning can be applied to the field of circadian rhythms. Circadian rhythms are fundamental for all forms of life. Using deep learning, we can gain insight into circadian rhythms on the molecular level. Finally, we propose new deep learning algorithms that yield significant performance improvements on computer vision and high energy physics tasks.

Cover page: Deep Learning for Puzzles and Circadian Rhythms

Creative Commons 'BY' version 4.0 license

Article
Peer Reviewed

CircadiOmics: circadian omic web portal

UC Irvine Previously Published Works (2022)

Circadian rhythms are a foundational aspect of biology. These rhythms are found at the molecular level in every cell of every living organism and they play a fundamental role in homeostasis and a variety of physiological processes. As a result, biomedical research of circadian rhythms continues to expand at a rapid pace. To support this research, CircadiOmics (http://circadiomics.igb.uci.edu/) is the largest annotated repository and analytic web server for high-throughput omic (e.g. transcriptomic, metabolomic, proteomic) circadian time series experimental data. CircadiOmics contains over 290 experiments and over 100 million individual measurements, across >20 unique tissues/organs, and 11 different species. Users are able to visualize and mine these datasets by deriving and comparing periodicity statistics for oscillating molecular species including: period, amplitude, phase, P-value and q-value. These statistics are obtained from BIO_CYCLE and JTK_CYCLE and are intuitively aggregated and displayed for comparison. CircadiOmics is the most up-to-date and cutting-edge web portal for searching and analyzing circadian omic data and is used by researchers around the world.

Cover page: CircadiOmics: circadian omic web portal

Article
Peer Reviewed

What time is it? Deep learning approaches for circadian rhythms

UC Irvine Previously Published Works (2016)

Motivation

Circadian rhythms date back to the origins of life, are found in virtually every species and every cell, and play fundamental roles in functions ranging from metabolism to cognition. Modern high-throughput technologies allow the measurement of concentrations of transcripts, metabolites and other species along the circadian cycle creating novel computational challenges and opportunities, including the problems of inferring whether a given species oscillate in circadian fashion or not, and inferring the time at which a set of measurements was taken.

Results

We first curate several large synthetic and biological time series datasets containing labels for both periodic and aperiodic signals. We then use deep learning methods to develop and train BIO_CYCLE, a system to robustly estimate which signals are periodic in high-throughput circadian experiments, producing estimates of amplitudes, periods, phases, as well as several statistical significance measures. Using the curated data, BIO_CYCLE is compared to other approaches and shown to achieve state-of-the-art performance across multiple metrics. We then use deep learning methods to develop and train BIO_CLOCK to robustly estimate the time at which a particular single-time-point transcriptomic experiment was carried. In most cases, BIO_CLOCK can reliably predict time, within approximately 1 h, using the expression levels of only a small number of core clock genes. BIO_CLOCK is shown to work reasonably well across tissue types, and often with only small degradation across conditions. BIO_CLOCK is used to annotate most mouse experiments found in the GEO database with an inferred time stamp.

Availability and implementation

All data and software are publicly available on the CircadiOmics web portal: circadiomics.igb.uci.edu/

Contacts

fagostin@uci.edu or pfbaldi@uci.edu

Supplementary information

Supplementary data are available at Bioinformatics online.

Article
Peer Reviewed

What time is it? Deep learning approaches for circadian rhythms

UC Irvine Previously Published Works (2016)

Article
Peer Reviewed

CircadiOmics: circadian omic web portal

UC Irvine Previously Published Works (2018)

Circadian rhythms play a fundamental role at all levels of biological organization. Understanding the mechanisms and implications of circadian oscillations continues to be the focus of intense research. However, there has been no comprehensive and integrated way for accessing and mining all circadian omic datasets. The latest release of CircadiOmics (http://circadiomics.ics.uci.edu) fills this gap for providing the most comprehensive web server for studying circadian data. The newly updated version contains high-throughput 227 omic datasets corresponding to over 74 million measurements sampled over 24 h cycles. Users can visualize and compare oscillatory trajectories across species, tissues and conditions. Periodicity statistics (e.g. period, amplitude, phase, P-value, q-value etc.) obtained from BIO_CYCLE and other methods are provided for all samples in the repository and can easily be downloaded in the form of publication-ready figures and tables. New features and substantial improvements in performance and data volume make CircadiOmics a powerful web portal for integrated analysis of circadian omic data.

Article
Peer Reviewed

Hippocampal ensembles represent sequential relationships among an extended sequence of nonspatial events

UC Irvine Previously Published Works (2022)

The hippocampus is critical to the temporal organization of our experiences. Although this fundamental capacity is conserved across modalities and species, its underlying neuronal mechanisms remain unclear. Here we recorded hippocampal activity as rats remembered an extended sequence of nonspatial events unfolding over several seconds, as in daily life episodes in humans. We then developed statistical machine learning methods to analyze the ensemble activity and discovered forms of sequential organization and coding important for order memory judgments. Specifically, we found that hippocampal ensembles provide significant temporal coding throughout nonspatial event sequences, differentiate distinct types of task-critical information sequentially within events, and exhibit theta-associated reactivation of the sequential relationships among events. We also demonstrate that nonspatial event representations are sequentially organized within individual theta cycles and precess across successive cycles. These findings suggest a fundamental function of the hippocampal network is to encode, preserve, and predict the sequential order of experiences.

Cover page: Hippocampal ensembles represent sequential relationships among an extended sequence of nonspatial events

Article
Peer Reviewed

Transverse momentum dependence of inclusive primary charged-particle production in p–Pb collisions at sNN=5.02TeV

UC Berkeley Previously Published Works (2014)

The transverse momentum (Formula presented.) distribution of primary charged particles is measured at midrapidity in minimum-bias p–Pb collisions at (Formula presented.)NN = 5.02(Formula presented.) TeV with the ALICE detector at the LHC in the range. The spectra are compared to the expectation based on binary collision scaling of particle production in pp collisions, leading to a nuclear modification factor consistent with unity for (Formula presented.) larger than 2 GeV/(Formula presented.)(Formula presented.) around 4 (Formula presented.). The measurement is compared to theoretical calculations and to data in Pb–Pb collisions at (Formula presented.) TeV.

Cover page: Transverse momentum dependence of inclusive primary charged-particle production in p–Pb collisions at sNN=5.02TeV

Article
Peer Reviewed

Event-by-event mean pT fluctuations in pp and Pb–Pb collisions at the LHC

UC Berkeley Previously Published Works (2014)

Event-by-event fluctuations of the mean transverse momentum of charged particles produced in pp collisions at 0.9, 2.76 and 7 TeV, and Pb–Pb collisions at 2.76 TeV are studied as a function of the charged-particle multiplicity using the ALICE detector at the LHC. Dynamical fluctuations indicative of correlated particle emission are observed in all systems. The results in pp collisions show little dependence on collision energy. The Monte Carlo event generators PYTHIA and PHOJET are in qualitative agreement with the data. Peripheral Pb–Pb data exhibit a similar multiplicity dependence as that observed in pp. In central Pb–Pb, the results deviate from this trend, featuring a significant reduction of the fluctuation strength. The results in Pb–Pb are in qualitative agreement with previous measurements in Au–Au at lower collision energies and with expectations from models that incorporate collective phenomena.

Cover page: Event-by-event mean pT fluctuations in pp and Pb–Pb collisions at the LHC

Article
Peer Reviewed

Energy dependence of the transverse momentum distributions of charged particles in pp collisions measured by ALICE

UC Berkeley Previously Published Works (2013)

Differential cross sections of charged particles in inelastic pp collisions as a function of p_T have been measured at [Formula: see text] at the LHC. The p_T spectra are compared to NLO-pQCD calculations. Though the differential cross section for an individual [Formula: see text] cannot be described by NLO-pQCD, the relative increase of cross section with [Formula: see text] is in agreement with NLO-pQCD. Based on these measurements and observations, procedures are discussed to construct pp reference spectra at [Formula: see text] up to p_T=50 GeV/c as required for the calculation of the nuclear modification factor in nucleus-nucleus and proton-nucleus collisions.

Cover page: Energy dependence of the transverse momentum distributions of charged particles in pp collisions measured by ALICE

Article
Peer Reviewed

Inclusive photon production at forward rapidities in proton–proton collisions at s = 0.9, 2.76 and 7 TeV

UC Berkeley Previously Published Works (2015)

The multiplicity and pseudorapidity distributions of inclusive photons have been measured at forward rapidities (2.3 < η < 3.9) in proton–proton collisions at three center-of-mass energies, √s = 0.9, 2.76 and 7 TeV using the ALICE detector. It is observed that the increase in the average photon multiplicity as a function of beam energy is compatible with both a logarithmic and a power-law dependence. The relative increase in average photon multiplicity produced in inelastic pp collisions at 2.76 and 7 TeV center-of-mass energies with respect to 0.9 TeV are 37.2± 0.3% (stat) ± 8.8% (sys) and 61.2 ± 0.3 % (stat) ± 7.6% (sys), respectively. The photon multiplicity distributions for all center-of-mass energies are well described by negative binomial distributions. The multiplicity distributions are also presented in terms of KNO variables. The results are compared to model predictions, which are found in general to underestimate the data at large photon multiplicities, in particular at the highest center-of-mass energy. Limiting fragmentation behavior of photons has been explored with the data, but is not observed in the measured pseudorapidity range.

Cover page: Inclusive photon production at forward rapidities in proton–proton collisions at s = 0.9, 2.76 and 7 TeV