Published in Genome Biology: HATCHet2: clone- and haplotype-specific copy number inference from bulk tumor sequencing data

By Matthew A. Myers, Brian J. Arnold, Vineet Bansal, Metin Balaban, Katelyn M. Mullen, Simone Zaccaria & Benjamin J. Raphael

Bulk DNA sequencing of multiple samples from the same tumor is becoming common, yet most methods to infer copy-number aberrations (CNAs) from this data analyze individual samples independently. We introduce HATCHet2, an algorithm to identify haplotype- and clone-specific CNAs simultaneously from multiple bulk samples. HATCHet2 extends the earlier HATCHet method by improving identification of focal CNAs and introducing a novel statistic, the minor haplotype B-allele frequency (mhBAF), that enables identification of mirrored-subclonal CNAs. We demonstrate HATCHet2’s improved accuracy using simulations and a single-cell sequencing dataset. HATCHet2 analysis of 10 prostate cancer patients reveals previously unreported mirrored-subclonal CNAs affecting cancer genes.

Read the paper: https://doi.org/10.1186/s13059-024-03267-x

Posted in Uncategorized

Published in Journal of Open-Source Software: hf_hydrodata: A Python package for accessing hydrologic simulations and observations across the United States

By Amy Defnet, William Hasling, Laura Condon, Amy Johnson, Georgios Artavanis, Amanda Triplett, William Lytle, and Reed Maxwell

The field of hydrologic modeling, or modeling of the terrestrial hydrologic cycle, is very data intensive. Models require many inputs to define topography, geology and atmospheric conditions. Additionally, in situ observations such as streamflow rate and depth to groundwater can be used to evaluate model outputs and calibrate input parameters. There are many public organizations and research groups in the United States which produce and make freely available parts of this required data. However, the data have a wide range of spatiotemporal resolutions, file types, and methods of access. This makes finding and accessing all the data required for analysis a very time-consuming part of most hydrologic studies. The hf_hydrodata package is designed to simplify this data acquisition process by providing access to a broad array of variables, all of which have been pre-processed for consistency.

Reas the paper: https://doi.org/10.21105/joss.06623

Posted in Uncategorized

Published in IEEE Transactions on Power Electronics: How MagNet: Machine Learning Framework for Modeling Power Magnetic Material Characteristics

By Haoran Li, Diego Serrano, Thomas Guillod, Shukai Wang, Evan Dogariu, Andrew Nadler, Min Luo, Vineet Bansal, Niraj K. Jha, Yuxin Chen, Charles R. Sullivan, and Minjie Chen

This article applies machine learning to power magnetics modeling. We first introduce an open-source database—MagNet—which hosts a large amount of experimentally measured excitation data for many materials across a variety of operating conditions, consisting of more than 500 000 data points in its current state. The processes for data acquisition and data quality control are explained. We then demonstrate a few neural network-based power magnetics modeling tools for modeling the core losses and B–H loops. The neural network allows multiple factors that may influence the magnetic characteristics to be modeled in a unified framework, where the nonlinear behaviors are captured with high accuracy and high generality. Neural network models are found to be effective in compressing the measurement data and predicting the material characteristics, paving the way for “neural networks as datasheets” to assist power magnetics design. Transfer learning is applied to the training of neural network models to further reduce the data size requirement while maintaining sufficient model accuracy.

Read the paper: https://doi.org/10.1109/TPEL.2023.3309232

Posted in Uncategorized

US-RSE’23

The inaugural conference from the US-RSE Association just wrapped up in Chicago! Somewhat paradoxically, the gathering felt like a culmination and a new beginning all at once. It was the culmination of years of effort tracing back to 2017 when earnest discussions towards organizing the US-RSE began. Some RSEs who had collaborated online for years finally got to meet in person for the first time! It’s also a new beginning: the US-RSE has “grown up,” in a sense, and we hope this conference is the first of many more to come in the United States.

The conference theme was “Software-Enabled Discovery and Beyond.” Participants from academia, industry, government labs, and other research institutions across the country came together to discuss topics like software-driven discovery and scholarship, software technology trends, software engineering best practices, community engagement, training resources, and the ongoing quest to build and grow the RSE profession.

Our RSE Group sent a handful of representatives and contributed the following content to the conference’s technical program:

  1. INnovative Training Enabled by a Research Software Engineering Community of Trainers (INTERSECT) – Ian Cosden and Jeffrey Carver (talk)
  2. Princeton University’s RSE Summer Internship and Fellowship Programs – Joel Bretheim, Ian Cosden, Peter Elmer, Garrett Wright, Colin Swaney, Abhishek Biswas, Henry Schreiner, Kilian Lieret, and Vineet Bansal (talk)

It was gratifying to see the RSE community in the US come together in this way and collaboratively make our first ever conference a resounding success. On the final day of the conference, we learned it will indeed return next year and it will be hosted in Albuquerque!

Posted in Uncategorized

Published in PEARC ’23: Jobstats: A Slurm-Compatible Job Monitoring Platform for CPU and GPU Clusters

Josko Plazonic, Jonathan Halverson, and Troy Comi

Job monitoring on high-performance computing clusters is important for evaluating hardware performance, troubleshooting failed jobs, identifying inefficient jobs and more. The combination of the Prometheus monitoring framework and the Grafana visualization toolkit has proven successful in recent years. This work shows how four Prometheus exporters can be configured for a Slurm cluster to provide detailed job-level information on CPU/GPU efficiencies and CPU/GPU memory usage as well as node-level Network File System (NFS) statistics and cluster-level General Parallel File System (GPFS) activity. A novel approach was devised to efficiently store a summary of this data in the Slurm database for each completed job. The open-source job monitoring platform introduced here can be used for batch, interactive and Open OnDemand jobs. Several tools are presented that use the Prometheus and Slurm databases to create dashboards, utilization reports and alerts.

Read the paper: https://doi.org/10.1145/3569951.3604396

Posted in Uncategorized

Region-specific reversal of epidermal planar polarity in the rosette fancy mouse

By Maureen Cetera, Rishabh Sharan, Gabriela Hayward-Lara, Brooke Phillips, Abhishek Biswas, Madalene Halley, Evalyn Beall, Bridgett vonHoldt, Danelle Devenport

The planar cell polarity (PCP) pathway collectively orients cells with respect to a body axis. Hair follicles of the murine epidermis provide a striking readout of PCP activity in their uniform alignment across the skin. Here, we characterize, from the molecular to tissue-scale, PCP establishment in the rosette fancy mouse, a natural variant with posterior-specific whorls in its fur, to understand how epidermal polarity is coordinated across the tissue. We find that rosette hair follicles emerge with reversed orientations specifically in the posterior region, creating a mirror image of epidermal polarity. The rosette trait is associated with a missense mutation in the core PCP gene Fzd6, which alters a consensus site for N-linked glycosylation, inhibiting its membrane localization. Unexpectedly, the Fzd6 trafficking defect does not block asymmetric localization of the other PCP proteins. Rather, the normally uniform axis of PCP asymmetry rotates where the PCP-directed cell movements that orient follicles are reversed, suggesting the PCP axis rotates 180°. Collectively, our multiscale analysis of epidermal polarity reveals PCP patterning can be regionally decoupled to produce posterior whorls in the rosette fancy mouse.

Read the paper: https://doi.org/10.1242/dev.202078

Posted in Uncategorized

Targeted viral adaptation generates a simian-tropic hepatitis B virus that infects marmoset cells

By Yongzhen Liu, Thomas R. Cafiero, Debby Park, Abhishek Biswas, Benjamin Y. Winer, Cheul H. Cho, Yaron Bram, Vasuretha Chandar, Aoife K. O’ Connell, Hans P. Gertje, Nicholas Crossland, Robert E. Schwartz & Alexander Ploss

Hepatitis B virus (HBV) only infects humans and chimpanzees, posing major challenges for modeling HBV infection and chronic viral hepatitis. The major barrier in establishing HBV infection in non-human primates lies at incompatibilities between HBV and simian orthologues of the HBV receptor, sodium taurocholate co-transporting polypeptide (NTCP). Through mutagenesis analysis and screening among NTCP orthologues from Old World monkeys, New World monkeys and prosimians, we determined key residues responsible for viral binding and internalization, respectively and identified marmosets as a suitable candidate for HBV infection. Primary marmoset hepatocytes and induced pluripotent stem cell-derived hepatocyte-like cells support HBV and more efficient woolly monkey HBV (WMHBV) infection. Adapted chimeric HBV genome harboring residues 1–48 of WMHBV preS1 generated here led to a more efficient infection than wild-type HBV in primary and stem cell derived marmoset hepatocytes. Collectively, our data demonstrate that minimal targeted simianization of HBV can break the species barrier in small NHPs, paving the path for an HBV primate model.

Read the paper: https://www.nature.com/articles/s41467-023-39148-3

Posted in Uncategorized

Published in Neuron: Integrating model development across computational neuroscience, cognitive science, and machine learning

By Padraig Gleeson, Sharon Crook, David Turner, Katherine Mantel, Mayank Raunak, Ted Willke, Jonathan D. Cohen

Neuroscience, cognitive science, and computer science are increasingly benefiting through their interactions. This could be accelerated by direct sharing of computational models across disparate modeling software used in each. We describe a Model Description Format designed to meet this challenge.

Read the paper: https://doi.org/10.1016/j.neuron.2023.03.037

Posted in Uncategorized

Structural features stabilized by divalent cation coordination within hepatitis E virus ORF1 are critical for viral replication

By Robert LeDesma, Brigitte Heller, Abhishek Biswas, Stephanie Maya, Stefania Gili, John Higgins, Alexander Ploss

Hepatitis E virus (HEV) is an RNA virus responsible for over 20 million infections annually. HEV’s open reading frame (ORF)1 polyprotein is essential for genome replication, though it is unknown how the different subdomains function within a structural context. Our data show that ORF1 operates as a multifunctional protein, which is not subject to proteolytic processing. Supporting this model, scanning mutagenesis performed on the putative papain-like cysteine protease (pPCP) domain revealed six cysteines essential for viral replication. Our data are consistent with their role in divalent metal ion coordination, which governs local and interdomain interactions that are critical for the overall structure of ORF1; furthermore, the ‘pPCP’ domain can only rescue viral genome replication in trans when expressed in the context of the full-length ORF1 protein but not as an individual subdomain. Taken together, our work provides a comprehensive model of the structure and function of HEV ORF1.

Read the paper: https://doi.org/10.7554/eLife.80529

Posted in Uncategorized

Generation and characterization of genetically and antigenically diverse infectious clones of dengue virus serotypes 1-4

By Tamura T, Zhang J, Madan V, Biswas A, Schwoerer MP, Cafiero TR, Heller BL, Wang W, Ploss A.

Dengue is caused by four genetically distinct viral serotypes, dengue virus (DENV) 1-4. Following transmission by Aedes mosquitoes, DENV can cause a broad spectrum of clinically apparent disease ranging from febrile illness to dengue hemorrhagic fever and dengue shock syndrome. Progress in the understanding of different dengue serotypes and their impacts on specific host-virus interactions has been hampered by the scarcity of tools that adequately reflect their antigenic and genetic diversity. To bridge this gap, we created and characterized infectious clones of DENV1-4 originating from South America, Africa, and Southeast Asia. Analysis of whole viral genome sequences of five DENV isolates from each of the four serotypes confirmed their broad genetic and antigenic diversity. Using a modified circular polymerase extension reaction (CPER), we generated de novo viruses from these isolates. The resultant clones replicated robustly in human and insect cells at levels similar to those of the parental strains. To investigate in vivo properties of these genetically diverse isolates, representative viruses from each DENV serotype were administered to NOD Rag1-/-, IL2rgnull Flk2-/- (NRGF) mice, engrafted with components of a human immune system. All DENV strains tested resulted in viremia in humanized mice and induced cellular and IgM immune responses. Collectively, we describe here a workflow for rapidly generating de novo infectious clones of DENV – and conceivably other RNA viruses. The infectious clones described here are a valuable resource for reverse genetic studies and for characterizing host responses to DENV in vitro and in vivo.

Read the paper: https://doi.org/10.1080/22221751.2021.2021808

Posted in Uncategorized