Search

Article
Peer Reviewed

¡Mueve la Almohada! ¡Levante la Cara! (Move the pillow. Lift your head) An Analysis of Correction Talk in Mexican and Central American Parent Child Interaction

Bhimji, Fazila

Issues in Applied Linguistics, Volume 8, Issue 2 (1997)

The paper examines parent children interaction in Mexican and Central American familes. The paper focuses on the forms of discourse parents adopt to correct children's speech and non-verbal behavior. The majority of the time parents employ unmodulated corrections and bald imperatives to direct children's behavior. When modulated forms of language are employed, it is done in the context of teasing. The paper also illustrates how children respond to corrections of their speech and behavior. Children exhibit an epistemological stance i.e., a display of knowledge most of the time and do not necessarily model correct forms of behavior in their subsequent turns.

Cover page: <em>¡Mueve la Almohada! ¡Levante la Cara!</em> (Move the pillow. Lift your head) An Analysis of Correction Talk in Mexican and Central American Parent Child Interaction

Article
Peer Reviewed

“Un Niño Puede Agarrar un Perro”: Children’s Use and Uptake of Directives in the Context of Play and Performance

Bhimji, Fazila

Issues in Applied Linguistics, Volume 15, Issue 1 (2006)

This paper examines the ways in which Mexican American children use directives in the context of play. There is a range of directives that young children employ as they do pretend play, teach their younger siblings new play skills, and spontaneously invent play. Much of the research discussing the use of directives among young children has not explored the range of directives they may use in mixed-age play but rather has argued that children learn to employ more complex forms as they become older. I argue that age is not the only factor leading children to use directives in complex forms. In mixed-age play, older children may simplify their directives and younger children may utter directives in complex ways to fit the play. Data are drawn from 50 hours of video-recording naturally occurring verbal and nonverbal actions among caregivers and young children in three Mexican American families living in South Central Los Angeles.

Cover page: “Un Niño Puede Agarrar un Perro”: Children’s Use and Uptake of Directives in the Context of Play and Performance

Article
Peer Reviewed

A Pattern Recognition Algorithm for Quantum Annealers

UC Berkeley Previously Published Works (2020)

The reconstruction of charged particles will be a key computing challenge for the high-luminosity Large Hadron Collider (HL-LHC) where increased data rates lead to a large increase in running time for current pattern recognition algorithms. An alternative approach explored here expresses pattern recognition as a quadratic unconstrained binary optimization (QUBO), which allows algorithms to be run on classical and quantum annealers. While the overall timing of the proposed approach and its scaling has still to be measured and studied, we demonstrate that, in terms of efficiency and purity, the same physics performance of the LHC tracking algorithms can be achieved. More research will be needed to achieve comparable performance in HL-LHC conditions, as increasing track density decreases the purity of the QUBO track segment classifier.

Cover page: A Pattern Recognition Algorithm for Quantum Annealers

Article
Peer Reviewed

Revealing Fundamental Physics from the Daya Bay Neutrino Experiment using Deep Neural Networks

UC Irvine Previously Published Works (2016)

Experiments in particle physics produce enormous quantities of data that must be analyzed and interpreted by teams of physicists. This analysis is often exploratory, where scientists are unable to enumerate the possible types of signal prior to performing the experiment. Thus, tools for summarizing, clustering, visualizing and classifying high-dimensional data are essential. In this work, we show that meaningful physical content can be revealed by transforming the raw data into a learned high-level representation using deep neural networks, with measurements taken at the Daya Bay Neutrino Experiment as a case study. We further show how convolutional deep neural networks can provide an effective classification filter with greater than 97% accuracy across different classes of physics events, significantly better than other machine learning approaches.

Cover page: Revealing Fundamental Physics from the Daya Bay Neutrino Experiment using Deep Neural Networks

Article
Peer Reviewed

Understanding the I/O Performance Gap Between Cori KNL and Haswell

LBL Publications (2017)

The Cori system at NERSC has two compute partitions with different CPU architectures: a 2,004 node Haswell partition and a 9,688 node KNL partition, which ranked as the 5th most powerful and fastest supercomputer on the November 2016 Top 500 list. The compute partitions share a common storage configuration, and understanding the IO performance gap between them is important, impacting not only to NERSC/LBNL users and other national labs, but also to the relevant hardware vendors and software developers. In this paper, we have analyzed performance of single core and single node IO comprehensively on the Haswell and KNL partitions, and have discovered the major bottlenecks, which include CPU frequencies and memory copy performance. We have also extended our performance tests to multi-node IO and revealed the IO cost difference caused by network latency, buffer size, and communication cost. Overall, we have developed a strong understanding of the IO gap between Haswell and KNL nodes and the lessons learned from this exploration will guide us in designing optimal IO solutions in many-core era.

Article

PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed Architectures:

LBL Publications (2016)

Computing k-Nearest Neighbors (KNN) is one of the core kernels used in many machine learning, data mining and scientific computing applications. Although kd-tree based O(log n) algorithms have been proposed for computing KNN, due to its inherent sequentiality, linear algorithms are being used in practice. This limits the applicability of such methods to millions of data points, with limited scalability for Big Data analytics challenges in the scientific domain. In this paper, we present parallel and highly optimized kd-tree based KNN algorithms (both construction and querying) suitable for distributed architectures. Our algorithm includes novel approaches for pruning search space and improving load balancing and partitioning among nodes and threads. Using TB-sized datasets from three science applications: astrophysics, plasma physics, and particle physics, we show that our implementation can construct kd-tree of 189 billion particles in 48 seconds on utilizing 50,000 cores. We also demonstrate computation of KNN of 19 billion queries in 12 seconds. We demonstrate almost linear speedup both for shared and distributed memory computers. Our algorithms outperforms earlier implementations by more than order of magnitude; thereby radically improving the applicability of our implementation to state-of-the-art Big Data analytics problems.

Article
Peer Reviewed

PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed Architectures.

LBL Publications (2016)

Article
Peer Reviewed

Interactive Distributed Deep Learning with Jupyter Notebooks

UC Berkeley Previously Published Works (2018)

Deep learning researchers are increasingly using Jupyter notebooks to implement interactive, reproducible workflows with embedded visualization, steering and documentation. Such solutions are typically deployed on small-scale (e.g. single server) computing systems. However, as the sizes and complexities of datasets and associated neural network models increase, high-performance distributed systems become important for training and evaluating models in a feasible amount of time. In this paper we describe our vision for Jupyter notebook solutions to deploy deep learning workloads onto high-performance computing systems. We demonstrate the effectiveness of notebooks for distributed training and hyper-parameter optimization of deep neural networks with efficient, scalable backends.

Cover page: Interactive Distributed Deep Learning with Jupyter Notebooks

Peer Reviewed

The NERSC Cori HPC System

NERSC (2019)

Article

Experiences with the Burst Buffer at NERSC:

LBL Publications (2016)

NVRAM-based Burst Buffers are an important part of the emerging HPC storage landscape. The National Energy Research Scientific Computing Center (NERSC) at Lawrence Berkeley National Laboratory recently installed one of the first Burst Buffer systems as part of its new Cori supercomputer, collaborating with Cray on the development of the DataWarp software. NERSC has over 6500 users in 750 different projects spanning a wide variety of scientific applications, including climate modeling, combustion, fusion, astrophysics, computational biology, and many more. The applications of the Burst Buffer at NERSC are therefore also considerable and diverse. We describe here experiences with the first year of the NERSC Burst Buffer. A number of research projects have had early access to the Burst Buffer and exercise its different capabilities to enable new scientific advancements. We present in-depth performance results and lessons-learned from these real applications as well as benchmark results and system configuration experiences.

Cover page: Experiences with the Burst Buffer at NERSC: