University of Wisconsin–Madison

UniversityMadison, Wisconsin, United States

Research output, citation impact, and the most-cited recent papers from University of Wisconsin–Madison (United States). Aggregated across the NobleBlocks index of 300M+ scholarly works.

Total works

375.8K

Citations

43.5M

h-index

1425

i10-index

505.2K

Also known as

UW–MadisonUniversidad de Wisconsin-MadisonUniversity of Wisconsin–MadisonUniversité du Wisconsin à Madison

Top-cited papers from University of Wisconsin–Madison

Fitting Linear Mixed-Effects Models Using <b>lme4</b>

Douglas M. Bates, Martin Mächler, Benjamin M. Bolker, Steve Walker

2015· Journal of Statistical Software86.5Kdoi:10.18637/jss.v067.i01

Maximum likelihood or restricted maximum likelihood (REML) estimates of the parameters in linear mixed-effects models can be determined using the lmer function in the lme4 package for R. As for most model-fitting functions in R, the model is described in an lmer call by a formula, in this case including both fixed- and random-effects terms. The formula and data together determine a numerical representation of the model from which the profiled deviance or the profiled REML criterion can be evaluated as a function of some of the model parameters. The appropriate criterion is optimized, using one of the constrained optimization functions in R, to provide the parameter estimates. We describe the structure of the model, the steps in evaluating the profiled deviance or REML criterion, and the structure of classes or types that represents such a model. Sufficient detail is included to allow specialization of these structures by users who wish to write functions to fit specialized linear mixed models, such as models incorporating pedigrees or smoothing splines, that are not easily expressible in the formula language used by lmer.

SciPy 1.0: fundamental algorithms for scientific computing in Python

Pauli Virtanen, Ralf Gommers, Travis E. Oliphant, Matt Haberland +4 more

2020· Nature Methods38.5Kdoi:10.1038/s41592-019-0686-2

SciPy is an open-source scientific computing library for the Python programming language. Since its initial release in 2001, SciPy has become a de facto standard for leveraging scientific algorithms in Python, with over 600 unique code contributors, thousands of dependent packages, over 100,000 dependent repositories and millions of downloads per year. In this work, we provide an overview of the capabilities and development practices of SciPy 1.0 and highlight some recent technical developments.

The central role of the propensity score in observational studies for causal effects

Paul R. Rosenbaum, Donald B. Rubin

1983· Biometrika31.1Kdoi:10.1093/biomet/70.1.41

The propensity score is the conditional probability of assignment to a particular treatment given a vector of observed covariates. Both large and small sample theory show that adjustment for the scalar propensity score is sufficient to remove bias due to all observed covariates. Applications include: (i) matched sampling on the univariate propensity score, which is a generalization of discriminant matching, (ii) multivariate adjustment by subclassification on the propensity score where the same subclasses are used to estimate treatment effects for all outcome variables and in all subpopulations, and (iii) visual representation of multivariate covariance adjustment by a two- dimensional plot.

MrBayes 3.2: Efficient Bayesian Phylogenetic Inference and Model Choice Across a Large Model Space

Fredrik Ronquist, Maxim Teslenko, Paul van der Mark, Daniel L. Ayres +4 more

2012· Systematic Biology28.2Kdoi:10.1093/sysbio/sys029

Since its introduction in 2001, MrBayes has grown in popularity as a software package for Bayesian phylogenetic inference using Markov chain Monte Carlo (MCMC) methods. With this note, we announce the release of version 3.2, a major upgrade to the latest official release presented in 2003. The new version provides convergence diagnostics and allows multiple analyses to be run in parallel with convergence progress monitored on the fly. The introduction of new proposals and automatic optimization of tuning parameters has improved convergence for many problems. The new version also sports significantly faster likelihood calculations through streaming single-instruction-multiple-data extensions (SSE) and support of the BEAGLE library, allowing likelihood calculations to be delegated to graphics processing units (GPUs) on compatible hardware. Speedup factors range from around 2 with SSE code to more than 50 with BEAGLE for codon problems. Checkpointing across all models allows long runs to be completed even when an analysis is prematurely terminated. New models include relaxed clocks, dating, model averaging across time-reversible substitution models, and support for hard, negative, and partial (backbone) tree constraints. Inference of species trees from gene trees is supported by full incorporation of the Bayesian estimation of species trees (BEST) algorithms. Marginal model likelihoods for Bayes factor tests can be estimated accurately across the entire model space using the stepping stone method. The new version provides more output options than previously, including samples of ancestral states, site rates, site d(N)/d(S) rations, branch rates, and node dates. A wide range of statistics on tree parameters can also be output for visualization in FigTree and compatible software.

RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome

Bo Li, Colin N. Dewey

2011· BMC Bioinformatics23.7Kdoi:10.1186/1471-2105-12-323

BACKGROUND: RNA-Seq is revolutionizing the way transcript abundances are measured. A key challenge in transcript quantification from RNA-Seq data is the handling of reads that map to multiple genes or isoforms. This issue is particularly important for quantification with de novo transcriptome assemblies in the absence of sequenced genomes, as it is difficult to determine which transcripts are isoforms of the same gene. A second significant issue is the design of RNA-Seq experiments, in terms of the number of reads, read length, and whether reads come from one or both ends of cDNA fragments. RESULTS: We present RSEM, an user-friendly software package for quantifying gene and isoform abundances from single-end or paired-end RNA-Seq data. RSEM outputs abundance estimates, 95% credibility intervals, and visualization files and can also simulate RNA-Seq data. In contrast to other existing tools, the software does not require a reference genome. Thus, in combination with a de novo transcriptome assembler, RSEM enables accurate transcript quantification for species without sequenced genomes. On simulated and real data sets, RSEM has superior or comparable performance to quantification methods that rely on a reference genome. Taking advantage of RSEM's ability to effectively use ambiguously-mapping reads, we show that accurate gene-level abundance estimates are best obtained with large numbers of short single-end reads. On the other hand, estimates of the relative frequencies of isoforms within single genes may be improved through the use of paired-end reads, depending on the number of possible splice forms for each gene. CONCLUSIONS: RSEM is an accurate and user-friendly software tool for quantifying transcript abundances from RNA-Seq data. As it does not rely on the existence of a reference genome, it is particularly useful for quantification with de novo transcriptome assemblies. In addition, RSEM has enabled valuable guidance for cost-efficient design of quantification experiments with RNA-Seq, which is currently relatively expensive.

Welcome to the Tidyverse

Hadley Wickham, Mara Averick, Jennifer Bryan, Winston Chang +4 more

2019· The Journal of Open Source Software22.0Kdoi:10.21105/joss.01686

RESUMENEvaluación del efecto de un curso nivelatorio de matemáticas en educación superior: el caso de Matemáticas Básicas La investigación evalúa los efectos de tomar un curso de nivelación obligatorio, que se ofrece una única vez (i.e.no puede repetirse) para estudiantes de pregrado, sobre la probabilidad de matricularse, el desempeño en las asignaturas universitarias de matemáticas, avance en la carrera y probabilidad de graduarse.La investigación emplea un diseño de regresión discontinua que aprovecha el hecho de que los estudiantes admitidos a la universidad que tengan en la prueba de ingreso un puntaje en matemáticas inferior a un umbral están obligados a tomar el curso de nivelación de matemáticas básicas.Se encuentra que el curso de nivelación no tiene un efecto en la probabilidad de matricularse, de desvincularse del programa ni de graduarse seis años después de haber sido admitido.Hay un efecto

A global reference for human genetic variation

Corresponding authors, Adam Auton, Gonçalo R. Abecasis, David M. Altshuler +4 more

2015· Nature20.2Kdoi:10.1038/nature15393

The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies. Results for the final phase of the 1000 Genomes Project are presented including whole-genome sequencing, targeted exome sequencing, and genotyping on high-density SNP arrays for 2,504 individuals across 26 populations, providing a global reference data set to support biomedical genetics. The 1000 Genomes Project has sought to comprehensively catalogue human genetic variation across populations, providing a valuable public genomic resource. The data obtained so far have found applications ranging from association studies and fine mapping studies to the filtering of likely neutral variants in rare-disease cohorts. The authors now report on the final phase of the project, phase 3, which covers previously uncharacterized areas of human genetic diversity in terms of the populations sampled and categories of characterized variation. The sample now includes more than 2,500 individuals from 26 global populations, with low coverage whole-genome and deep exome sequencing, as well as dense microarray genotyping. They find that while most common variants are shared across populations, rarer variants are often restricted to closely related populations. The authors also demonstrate the use of the phase 3 dataset as a reference panel for imputation to improve the resolution in genetic association studies.

A Simple, Positive Semi-Definite, Heteroskedasticity and Autocorrelation Consistent Covariance Matrix

Whitney K. Newey, Kenneth D. West

1987· Econometrica17.2Kdoi:10.2307/1913610

This paper describes a simple method of calculating a heteroskedasticity and autocorrelation consistent covariance matrix that is positive semi-definite by construction. It also establishes consistency of the estimated covariance matrix under fairly general conditions.

Embryonic Stem Cell Lines Derived from Human Blastocysts

James A. Thomson, Joseph Itskovitz‐Eldor, Sander S. Shapiro, Michelle Waknitz +3 more

1998· Science15.9Kdoi:10.1126/science.282.5391.1145

Human blastocyst-derived, pluripotent cell lines are described that have normal karyotypes, express high levels of telomerase activity, and express cell surface markers that characterize primate embryonic stem cells but do not characterize other early lineages. After undifferentiated proliferation in vitro for 4 to 5 months, these cells still maintained the developmental potential to form trophoblast and derivatives of all three embryonic germ layers, including gut epithelium (endoderm); cartilage, bone, smooth muscle, and striated muscle (mesoderm); and neural epithelium, embryonic ganglia, and stratified squamous epithelium (ectoderm). These cell lines should be useful in human developmental biology, drug discovery, and transplantation medicine.

A comprehensive set of sequence analysis programs for the VAX

John Devereux, Paul Haeberli, Oliver Smithies

1984· Nucleic Acids Research14.4Kdoi:10.1093/nar/12.1part1.387

The University of Wisconsin Genetics Computer Group (UWGCG) has been organized to develop computational tools for the analysis and publication of biological sequence data. A group of programs that will interact with each other has been developed for the Digital Equipment Corporation VAX computer using the VMS operating system. The programs available and the conditions for transfer are described.

A Proportional Hazards Model for the Subdistribution of a Competing Risk

Jason P. Fine, Malcolm H. Ray

1999· Journal of the American Statistical Association13.7Kdoi:10.1080/01621459.1999.10474144

Abstract With explanatory covariates, the standard analysis for competing risks data involves modeling the cause-specific hazard functions via a proportional hazards assumption. Unfortunately, the cause-specific hazard function does not have a direct interpretation in terms of survival probabilities for the particular failure type. In recent years many clinicians have begun using the cumulative incidence function, the marginal failure probabilities for a particular cause, which is intuitively appealing and more easily explained to the nonstatistician. The cumulative incidence is especially relevant in cost-effectiveness analyses in which the survival probabilities are needed to determine treatment utility. Previously, authors have considered methods for combining estimates of the cause-specific hazard functions under the proportional hazards formulation. However, these methods do not allow the analyst to directly assess the effect of a covariate on the marginal probability function. In this article we propose a novel semiparametric proportional hazards model for the subdistribution. Using the partial likelihood principle and weighting techniques, we derive estimation and inference procedures for the finite-dimensional regression parameter under a variety of censoring scenarios. We give a uniformly consistent estimator for the predicted cumulative incidence for an individual with certain covariates; confidence intervals and bands can be obtained analytically or with an easy-to-implement simulation technique. To contrast the two approaches, we analyze a dataset from a breast cancer clinical trial under both models. Key Words: Hazard of subdistributionMartingalePartial likelihoodTransformation model

Global Consequences of Land Use

Jonathan A. Foley, Ruth DeFries, Gregory P. Asner, Carol Barford +4 more

2005· Science13.0Kdoi:10.1126/science.1111772

Land use has generally been considered a local environmental issue, but it is becoming a force of global importance. Worldwide changes to forests, farmlands, waterways, and air are being driven by the need to provide food, fiber, water, and shelter to more than six billion people. Global croplands, pastures, plantations, and urban areas have expanded in recent decades, accompanied by large increases in energy, water, and fertilizer consumption, along with considerable losses of biodiversity. Such changes in land use have enabled humans to appropriate an increasing share of the planet's resources, but they also potentially undermine the capacity of ecosystems to sustain food production, maintain freshwater and forest resources, regulate climate and air quality, and ameliorate infectious diseases. We face the challenge of managing trade-offs between immediate human needs and maintaining the capacity of the biosphere to provide goods and services in the long term.

A Paradigm for Developing Better Measures of Marketing Constructs

Gilbert A. Churchill

1979· Journal of Marketing Research12.7Kdoi:10.2307/3150876

A critical element in the evolution of a fundamental body of knowledge in marketing, as well as for improved marketing practice, is the development of better measures of the variables with which ma...

Bioconductor: open software development for computational biology and bioinformatics

Robert Gentleman, Vincent J. Carey, Douglas M. Bates, Ben Bolstad +4 more

2004· Genome biology12.6Kdoi:10.1186/gb-2004-5-10-r80

The Bioconductor project is an initiative for the collaborative creation of extensible software for computational biology and bioinformatics. The goals of the project include: fostering collaborative development and widespread use of innovative software, reducing barriers to entry into interdisciplinary scientific research, and promoting the achievement of remote reproducibility of research results. We describe details of our aims and methods, identify current challenges, compare Bioconductor to other open bioinformatics projects, and provide working examples.

Managing Legitimacy: Strategic and Institutional Approaches

Mark C. Suchman

1995· Academy of Management Review12.5Kdoi:10.5465/amr.1995.9508080331

This article synthesizes the large but diverse literature on organizational legitimacy, highlighting similarities and disparities among the leading strategic and institutional approaches. The analysis identifies three primary forms of legitimacy: pragmatic, based on audience self-interest; moral, based on normative approval: and cognitive, based on comprehensibility and taken-for-grantedness. The article then examines strategies for gaining, maintaining, and repairing legitimacy of each type, suggesting both the promises and the pitfalls of such instrumental manipulations.

On the Einstein Podolsky Rosen paradox

J. S. Bell

1964· Physics Physique Fizika12.3Kdoi:10.1103/physicsphysiquefizika.1.195

Planetary boundaries: Guiding human development on a changing planet

Will Steffen, Katherine Richardson, Johan Rockström, Sarah Cornell +4 more

2015· Science12.3Kdoi:10.1126/science.1259855

The planetary boundaries framework defines a safe operating space for humanity based on the intrinsic biophysical processes that regulate the stability of the Earth system. Here, we revise and update the planetary boundary framework, with a focus on the underpinning biophysical science, based on targeted input from expert research communities and on more general scientific advances over the past 5 years. Several of the boundaries now have a two-tier approach, reflecting the importance of cross-scale interactions and the regional-level heterogeneity of the processes that underpin the boundaries. Two core boundaries—climate change and biosphere integrity—have been identified, each of which has the potential on its own to drive the Earth system into a new state should they be substantially and persistently transgressed.

SciPy 1.0: fundamental algorithms for scientific computing in Python

Pauli Virtanen, Ralf Gommers, Travis E. Oliphant, Matt Haberland +4 more

2019· Monash University Research Portal (Monash University)11.6Kdoi:10.17863/cam.64134

Abstract: SciPy is an open-source scientific computing library for the Python programming language. Since its initial release in 2001, SciPy has become a de facto standard for leveraging scientific algorithms in Python, with over 600 unique code contributors, thousands of dependent packages, over 100,000 dependent repositories and millions of downloads per year. In this work, we provide an overview of the capabilities and development practices of SciPy 1.0 and highlight some recent technical developments.

Evaluation, Treatment, and Prevention of Vitamin D Deficiency: an Endocrine Society Clinical Practice Guideline

Michael F. Holick, Neil Binkley, Heike A. Bischoff‐Ferrari, Catherine M. Gordon +4 more

2011· The Journal of Clinical Endocrinology & Metabolism10.5Kdoi:10.1210/jc.2011-0385

OBJECTIVE: The objective was to provide guidelines to clinicians for the evaluation, treatment, and prevention of vitamin D deficiency with an emphasis on the care of patients who are at risk for deficiency. PARTICIPANTS: The Task Force was composed of a Chair, six additional experts, and a methodologist. The Task Force received no corporate funding or remuneration. CONSENSUS PROCESS: Consensus was guided by systematic reviews of evidence and discussions during several conference calls and e-mail communications. The draft prepared by the Task Force was reviewed successively by The Endocrine Society's Clinical Guidelines Subcommittee, Clinical Affairs Core Committee, and cosponsoring associations, and it was posted on The Endocrine Society web site for member review. At each stage of review, the Task Force received written comments and incorporated needed changes. CONCLUSIONS: Considering that vitamin D deficiency is very common in all age groups and that few foods contain vitamin D, the Task Force recommended supplementation at suggested daily intake and tolerable upper limit levels, depending on age and clinical circumstances. The Task Force also suggested the measurement of serum 25-hydroxyvitamin D level by a reliable assay as the initial diagnostic test in patients at risk for deficiency. Treatment with either vitamin D(2) or vitamin D(3) was recommended for deficient patients. At the present time, there is not sufficient evidence to recommend screening individuals who are not at risk for deficiency or to prescribe vitamin D to attain the noncalcemic benefit for cardiovascular protection.

Induced Pluripotent Stem Cell Lines Derived from Human Somatic Cells

Junying Yu, Maxim A. Vodyanik, Kim Smuga-Otto, Jessica Antosiewicz‐Bourget +4 more

2007· Science10.3Kdoi:10.1126/science.1151526

Somatic cell nuclear transfer allows trans-acting factors present in the mammalian oocyte to reprogram somatic cell nuclei to an undifferentiated state. We show that four factors (OCT4, SOX2, NANOG, and LIN28) are sufficient to reprogram human somatic cells to pluripotent stem cells that exhibit the essential characteristics of embryonic stem (ES) cells. These induced pluripotent human stem cells have normal karyotypes, express telomerase activity, express cell surface markers and genes that characterize human ES cells, and maintain the developmental potential to differentiate into advanced derivatives of all three primary germ layers. Such induced pluripotent human cell lines should be useful in the production of new disease models and in drug development, as well as for applications in transplantation medicine, once technical limitations (for example, mutation through viral integration) are eliminated.

Search all NobleBlocks papers mentioning “University of Wisconsin–Madison” →