NobleBlocks
Constructor University logo

Constructor University

UniversityBremen, City state Bremen, Germany

Research output, citation impact, and the most-cited recent papers from Constructor University (Germany). Aggregated across the NobleBlocks index of 300M+ scholarly works.

Total works
11.4K
Citations
1.2M
h-index
303
i10-index
11.0K
Also known as
Constructor UniversityConstructor University BremenInternational University BremenJacobs UniversityJacobs University Bremen

Top-cited papers from Constructor University

The SILVA ribosomal RNA gene database project: improved data processing and web-based tools
Christian Quast, Elmar Pruesse, Pelin Yilmaz, Jan Gerken +4 more
2012· Nucleic Acids Research33.9Kdoi:10.1093/nar/gks1219

SILVA (from Latin silva, forest, http://www.arb-silva.de) is a comprehensive web resource for up to date, quality-controlled databases of aligned ribosomal RNA (rRNA) gene sequences from the Bacteria, Archaea and Eukaryota domains and supplementary online services. The referred database release 111 (July 2012) contains 3 194 778 small subunit and 288 717 large subunit rRNA gene sequences. Since the initial description of the project, substantial new features have been introduced, including advanced quality control procedures, an improved rRNA gene aligner, online tools for probe and primer evaluation and optimized browsing, searching and downloading on the website. Furthermore, the extensively curated SILVA taxonomy and the new non-redundant SILVA datasets provide an ideal reference for high-throughput classification of data from next-generation sequencing approaches.

Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
2014· arXiv (Cornell University)14.6Kdoi:10.48550/arxiv.1409.0473

Neural machine translation is a recently proposed approach to machine translation. Unlike the traditional statistical machine translation, the neural machine translation aims at building a single neural network that can be jointly tuned to maximize the translation performance. The models proposed recently for neural machine translation often belong to a family of encoder-decoders and consists of an encoder that encodes a source sentence into a fixed-length vector from which a decoder generates a translation. In this paper, we conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and propose to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly. With this new approach, we achieve a translation performance comparable to the existing state-of-the-art phrase-based system on the task of English-to-French translation. Furthermore, qualitative analysis reveals that the (soft-)alignments found by the model agree well with our intuition.

Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies
Anna Klindworth, Elmar Pruesse, Timmy Schweer, Jörg Peplies +3 more
2012· Nucleic Acids Research9.0Kdoi:10.1093/nar/gks808

16S ribosomal RNA gene (rDNA) amplicon analysis remains the standard approach for the cultivation-independent investigation of microbial diversity. The accuracy of these analyses depends strongly on the choice of primers. The overall coverage and phylum spectrum of 175 primers and 512 primer pairs were evaluated in silico with respect to the SILVA 16S/18S rDNA non-redundant reference dataset (SSURef 108 NR). Based on this evaluation a selection of 'best available' primer pairs for Bacteria and Archaea for three amplicon size classes (100-400, 400-1000, ≥ 1000 bp) is provided. The most promising bacterial primer pair (S-D-Bact-0341-b-S-17/S-D-Bact-0785-a-A-21), with an amplicon size of 464 bp, was experimentally evaluated by comparing the taxonomic distribution of the 16S rDNA amplicons with 16S rDNA fragments from directly sequenced metagenomes. The results of this study may be used as a guideline for selecting primer pairs with the best overall coverage and phylum spectrum for specific applications, therefore reducing the bias in PCR-based microbial diversity studies.

SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB
Elmar Pruesse, Christian Quast, Katrin Knittel, Bernhard M. Fuchs +3 more
2007· Nucleic Acids Research6.9Kdoi:10.1093/nar/gkm864

Sequencing ribosomal RNA (rRNA) genes is currently the method of choice for phylogenetic reconstruction, nucleic acid based detection and quantification of microbial diversity. The ARB software suite with its corresponding rRNA datasets has been accepted by researchers worldwide as a standard tool for large scale rRNA analysis. However, the rapid increase of publicly available rRNA sequence data has recently hampered the maintenance of comprehensive and curated rRNA knowledge databases. A new system, SILVA (from Latin silva, forest), was implemented to provide a central comprehensive web resource for up to date, quality controlled databases of aligned rRNA sequences from the Bacteria, Archaea and Eukarya domains. All sequences are checked for anomalies, carry a rich set of sequence associated contextual information, have multiple taxonomic classifications, and the latest validly described nomenclature. Furthermore, two precompiled sequence datasets compatible with ARB are offered for download on the SILVA website: (i) the reference (Ref) datasets, comprising only high quality, nearly full length sequences suitable for in-depth phylogenetic analysis and probe design and (ii) the comprehensive Parc datasets with all publicly available rRNA sequences longer than 300 nucleotides suitable for biodiversity analyses. The latest publicly available database release 91 (August 2007) hosts 547 521 sequences split into 461 823 small subunit and 85 689 large subunit rRNAs.

On the Properties of Neural Machine Translation: Encoder–Decoder Approaches
Kyunghyun Cho, Bart van Merriënboer, Dzmitry Bahdanau, Yoshua Bengio
20146.6Kdoi:10.3115/v1/w14-4012

Neural machine translation is a relatively new approach to statistical machine translation based purely on neural networks.The neural machine translation models often consist of an encoder and a decoder.The encoder extracts a fixed-length representation from a variable-length input sentence, and the decoder generates a correct translation from this representation.In this paper, we focus on analyzing the properties of the neural machine translation using two models; RNN Encoder-Decoder and a newly proposed gated recursive convolutional neural network.We show that the neural machine translation performs relatively well on short sentences without unknown words, but its performance degrades rapidly as the length of the sentence and the number of unknown words increase.Furthermore, we find that the proposed gated recursive convolutional network learns a grammatical structure of a sentence automatically.

The SILVA and “All-species Living Tree Project (LTP)” taxonomic frameworks
Pelin Yilmaz, Laura Wegener Parfrey, Pablo Yarza, Jan Gerken +4 more
2013· Nucleic Acids Research4.1Kdoi:10.1093/nar/gkt1209

SILVA (from Latin silva, forest, http://www.arb-silva.de) is a comprehensive resource for up-to-date quality-controlled databases of aligned ribosomal RNA (rRNA) gene sequences from the Bacteria, Archaea and Eukaryota domains and supplementary online services. SILVA provides a manually curated taxonomy for all three domains of life, based on representative phylogenetic trees for the small- and large-subunit rRNA genes. This article describes the improvements the SILVA taxonomy has undergone in the last 3 years. Specifically we are focusing on the curation process, the various resources used for curation and the comparison of the SILVA taxonomy with Greengenes and RDP-II taxonomies. Our comparisons not only revealed a reasonable overlap between the taxa names, but also points to significant differences in both names and numbers of taxa between the three resources.

The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications
Rolf Henrik Nilsson, Karl-Henrik Larsson, Andy F. S. Taylor, Johan Bengtsson‐Palme +4 more
2018· Nucleic Acids Research3.6Kdoi:10.1093/nar/gky1022

UNITE (https://unite.ut.ee/) is a web-based database and sequence management environment for the molecular identification of fungi. It targets the formal fungal barcode-the nuclear ribosomal internal transcribed spacer (ITS) region-and offers all ∼1 000 000 public fungal ITS sequences for reference. These are clustered into ∼459 000 species hypotheses and assigned digital object identifiers (DOIs) to promote unambiguous reference across studies. In-house and web-based third-party sequence curation and annotation have resulted in more than 275 000 improvements to the data over the past 15 years. UNITE serves as a data provider for a range of metabarcoding software pipelines and regularly exchanges data with all major fungal sequence databases and other community resources. Recent improvements include redesigned handling of unclassifiable species hypotheses, integration with the taxonomic backbone of the Global Biodiversity Information Facility, and support for an unlimited number of parallel taxonomic classification systems.

Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho, Bart van Merriënboer, Çağlar Gülçehre, Dzmitry Bahdanau +3 more
2014· arXiv (Cornell University)3.4Kdoi:10.48550/arxiv.1406.1078

In this paper, we propose a novel neural network model called RNN Encoder-Decoder that consists of two recurrent neural networks (RNN). One RNN encodes a sequence of symbols into a fixed-length vector representation, and the other decodes the representation into another sequence of symbols. The encoder and decoder of the proposed model are jointly trained to maximize the conditional probability of a target sequence given a source sequence. The performance of a statistical machine translation system is empirically found to improve by using the conditional probabilities of phrase pairs computed by the RNN Encoder-Decoder as an additional feature in the existing log-linear model. Qualitatively, we show that the proposed model learns a semantically and syntactically meaningful representation of linguistic phrases.

SINA: Accurate high-throughput multiple sequence alignment of ribosomal RNA genes
Elmar Pruesse, Jörg Peplies, Frank Oliver Glöckner
2012· Bioinformatics3.2Kdoi:10.1093/bioinformatics/bts252

MOTIVATION: In the analysis of homologous sequences, computation of multiple sequence alignments (MSAs) has become a bottleneck. This is especially troublesome for marker genes like the ribosomal RNA (rRNA) where already millions of sequences are publicly available and individual studies can easily produce hundreds of thousands of new sequences. Methods have been developed to cope with such numbers, but further improvements are needed to meet accuracy requirements. RESULTS: In this study, we present the SILVA Incremental Aligner (SINA) used to align the rRNA gene databases provided by the SILVA ribosomal RNA project. SINA uses a combination of k-mer searching and partial order alignment (POA) to maintain very high alignment accuracy while satisfying high throughput performance demands. SINA was evaluated in comparison with the commonly used high throughput MSA programs PyNAST and mothur. The three BRAliBase III benchmark MSAs could be reproduced with 99.3, 97.6 and 96.1 accuracy. A larger benchmark MSA comprising 38 772 sequences could be reproduced with 98.9 and 99.3% accuracy using reference MSAs comprising 1000 and 5000 sequences. SINA was able to achieve higher accuracy than PyNAST and mothur in all performed benchmarks. AVAILABILITY: Alignment of up to 500 sequences using the latest SILVA SSU/LSU Ref datasets as reference MSA is offered at http://www.arb-silva.de/aligner. This page also links to Linux binaries, user manual and tutorial. SINA is made available under a personal use license.

Differences Between Tight and Loose Cultures: A 33-Nation Study
Michele J. Gelfand, Jana L. Raver, Lisa H. Nishii, Lisa M. Leslie +4 more
2011· Science3.2Kdoi:10.1126/science.1197754

With data from 33 nations, we illustrate the differences between cultures that are tight (have many strong norms and a low tolerance of deviant behavior) versus loose (have weak social norms and a high tolerance of deviant behavior). Tightness-looseness is part of a complex, loosely integrated multilevel system that comprises distal ecological and historical threats (e.g., high population density, resource scarcity, a history of territorial conflict, and disease and environmental threats), broad versus narrow socialization in societal institutions (e.g., autocracy, media regulations), the strength of everyday recurring situations, and micro-level psychological affordances (e.g., prevention self-guides, high regulatory strength, need for structure). This research advances knowledge that can foster cross-cultural understanding in a world of increasing global interdependence and has implications for modeling cultural change.

JSpeciesWS: a web server for prokaryotic species circumscription based on pairwise genome comparison
Michael Richter, Ramon Rosselló‐Móra, Frank Oliver Glöckner, Jörg Peplies
2015· Bioinformatics2.9Kdoi:10.1093/bioinformatics/btv681

UNLABELLED: JSpecies Web Server (JSpeciesWS) is a user-friendly online service for in silico calculating the extent of identity between two genomes, a parameter routinely used in the process of polyphasic microbial species circumscription. The service measures the average nucleotide identity (ANI) based on BLAST+ (ANIb) and MUMmer (ANIm), as well as correlation indexes of tetra-nucleotide signatures (Tetra). In addition, it provides a Tetra Correlation Search function, which allows to rapidly compare selected genomes against a continuously updated reference database with currently about 32 000 published whole and draft genome sequences. For comparison, own genomes can be uploaded and references can be selected from the JSpeciesWS reference database. The service indicates whether two genomes share genomic identities above or below the species embracing thresholds, and serves as a fast way to allocate unknown genomes in the frame of the hitherto sequenced species. AVAILABILITY AND IMPLEMENTATION: JSpeciesWS is available at http://jspecies.ribohost.com/jspeciesws SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. CONTACT: mrichter@ribocon.com.

LOFAR: The LOw-Frequency ARray
M. P. van Haarlem, M. W. Wise, A. W. Gunst, G. Heald +4 more
2013· Astronomy and Astrophysics2.7Kdoi:10.1051/0004-6361/201220873

LOFAR, the LOw-Frequency ARray, is a new-generation radio interferometer constructed in the north of the Netherlands and across europe. Utilizing a novel phased-array design, LOFAR covers the largely unexplored low-frequency range from 10–240 MHz and provides a number of unique observing capabilities. Spreading out from a core located near the village of Exloo in the northeast of the Netherlands, a total of 40 LOFAR stations are nearing completion. A further five stations have been deployed throughout Germany, and one station has been built in each of France, Sweden, and the UK. Digital beam-forming techniques make the LOFAR system agile and allow for rapid repointing of the telescope as well as the potential for multiple simultaneous observations. With its dense core array and long interferometric baselines, LOFAR achieves unparalleled sensitivity and angular resolution in the low-frequency radio regime. The LOFAR facilities are jointly operated by the International LOFAR Telescope (ILT) foundation, as an observatory open to the global astronomical community. LOFAR is one of the first radio observatories to feature automated processing pipelines to deliver fully calibrated science products to its user community. LOFAR’s new capabilities, techniques and modus operandi make it an important pathfinder for the Square Kilometre Array (SKA). We give an overview of the LOFAR instrument, its major hardware and software components, and the core science objectives that have driven its design. In addition, we present a selection of new results from the commissioning phase of this new radio observatory.

Modernization, Cultural Change, and Democracy
Ronald Inglehart, Christian Welzel
2001· Cambridge University Press eBooks2.6Kdoi:10.1017/cbo9780511790881

This book demonstrates that people's basic values and beliefs are changing, in ways that affect their political, sexual, economic, and religious behaviour. These changes are roughly predictable: to a large extent, they can be interpreted on the basis of a revised version of modernisation theory presented here. Drawing on a massive body of evidence from societies containing 85 percent of the world's population, the authors demonstrate that modernisation is a process of human development, in which economic development gives rise to cultural changes that make individual autonomy, gender equality, and democracy increasingly likely. The authors present a model of social change that predicts how the value systems play a crucial role in the emergence and flourishing of democratic institutions - and that modernisation brings coherent cultural changes that are conducive to democratisation.

Spatial Modulation
Raed Mesleh, Harald Haas, Sinan Sinanović, Chang Wook Ahn +1 more
2008· IEEE Transactions on Vehicular Technology2.2Kdoi:10.1109/tvt.2007.912136

<para xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> Spatial modulation (SM) is a recently developed transmission technique that uses multiple antennas. The basic idea is to map a block of information bits to two information carrying units: 1) a symbol that was chosen from a constellation diagram and 2) a unique transmit antenna number that was chosen from a set of transmit antennas. The use of the transmit antenna number as an information-bearing unit increases the overall spectral efficiency by the base-two logarithm of the number of transmit antennas. At the receiver, a maximum receive ratio combining algorithm is used to retrieve the transmitted block of information bits. Here, we apply SM to orthogonal frequency division multiplexing (OFDM) transmission. We develop an analytical approach for symbol error ratio (SER) analysis of the SM algorithm in independent identically distributed (i.i.d.) Rayleigh channels. The analytical and simulation results closely match. The performance and the receiver complexity of the SM–OFDM technique are compared to those of the vertical Bell Labs layered space–time (V-BLAST–OFDM) and Alamouti–OFDM algorithms. V-BLAST uses minimum mean square error (MMSE) detection with ordered successive interference cancellation. The combined effect of spatial correlation, mutual antenna coupling, and Rician fading on both coded and uncoded systems are presented. It is shown that, for the same spectral efficiency, SM results in a reduction of around 90% in receiver complexity as compared to V-BLAST and nearly the same receiver complexity as Alamouti. In addition, we show that SM achieves better performance in all studied channel conditions, as compared with other techniques. It is also shown to efficiently work for any configuration of transmit and receive antennas, even for the case of fewer receive antennas than transmit antennas. </para>

Construction of Crystalline 2D Covalent Organic Frameworks with Remarkable Chemical (Acid/Base) Stability via a Combined Reversible and Irreversible Route
Sharath Kandambeth, Arijit Mallick, Binit Lukose, Manoj V. Mane +2 more
2012· Journal of the American Chemical Society2.1Kdoi:10.1021/ja308278w

Two new chemically stable [acid and base] 2D crystalline covalent organic frameworks (COFs) (TpPa-1 and TpPa-2) were synthesized using combined reversible and irreversible organic reactions. Syntheses of these COFs were done by the Schiff base reactions of 1,3,5-triformylphloroglucinol (Tp) with p-phenylenediamine (Pa-1) and 2,5-dimethyl-p-phenylenediamine (Pa-2), respectively, in 1:1 mesitylene/dioxane. The expected enol-imine (OH) form underwent irreversible proton tautomerism, and only the keto-enamine form was observed. Because of the irreversible nature of the total reaction and the absence of an imine bond in the system, TpPa-1 and TpPa-2 showed strong resistance toward acid (9 N HCl) and boiling water. Moreover, TpPa-2 showed exceptional stability in base (9 N NaOH) as well.

Attention-Based Models for Speech Recognition
Jan Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, Kyunghyun Cho +1 more
2015· arXiv (Cornell University)1.8Kdoi:10.48550/arxiv.1506.07503

Recurrent sequence generators conditioned on input data through an attention mechanism have recently shown very good performance on a range of tasks in- cluding machine translation, handwriting synthesis and image caption gen- eration. We extend the attention-mechanism with features needed for speech recognition. We show that while an adaptation of the model used for machine translation in reaches a competitive 18.7% phoneme error rate (PER) on the TIMIT phoneme recognition task, it can only be applied to utterances which are roughly as long as the ones it was trained on. We offer a qualitative explanation of this failure and propose a novel and generic method of adding location-awareness to the attention mechanism to alleviate this issue. The new method yields a model that is robust to long inputs and achieves 18% PER in single utterances and 20% in 10-times longer (repeated) utterances. Finally, we propose a change to the at- tention mechanism that prevents it from concentrating too much on single frames, which further reduces PER to 17.6% level.

Influence of quantum confinement on the electronic structure of the transition metal sulfide<mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline"><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:math>S<mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline"><mml:mrow><mml:msub><mml:mrow/><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math>
Agnieszka Kuc, Nourdine Zibouche, Thomas Heine
2011· Physical Review B1.7Kdoi:10.1103/physrevb.83.245213

Bulk MoS${}_{2}$, a prototypical layered transition-metal dichalcogenide, is an indirect band gap semiconductor. Reducing its slab thickness to a monolayer, MoS${}_{2}$ undergoes a transition to the direct band semiconductor. We support this experimental observation by first-principle calculations and show that quantum confinement in layered $d$-electron dichalcogenides results in tuning the electronic structure. We further studied the properties of related $T$S${}_{2}$ nanolayers ($T=$ W, Nb, Re) and show that the isotopological WS${}_{2}$ exhibits similar electronic properties, while NbS${}_{2}$ and ReS${}_{2}$ remain metallic independent of the slab thickness.

OpenFace: An open source facial behavior analysis toolkit
Tadas Baltrušaitis, Peter Robinson, Louis‐Philippe Morency
20161.7Kdoi:10.1109/wacv.2016.7477553

Over the past few years, there has been an increased interest in automatic facial behavior analysis and understanding. We present OpenFace - an open source tool intended for computer vision and machine learning researchers, affective computing community and people interested in building interactive applications based on facial behavior analysis. OpenFace is the first open source tool capable of facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation. The computer vision algorithms which represent the core of OpenFace demonstrate state-of-the-art results in all of the above mentioned tasks. Furthermore, our tool is capable of real-time performance and is able to run from a simple webcam without any specialist hardware. Finally, OpenFace allows for easy integration with other applications and devices through a lightweight messaging system.

Substrate-Controlled Succession of Marine Bacterioplankton Populations Induced by a Phytoplankton Bloom
Hanno Teeling, Bernhard M. Fuchs, Dörte Becher, Christine Klockow +4 more
2012· Science1.5Kdoi:10.1126/science.1218344

Phytoplankton blooms characterize temperate ocean margin zones in spring. We investigated the bacterioplankton response to a diatom bloom in the North Sea and observed a dynamic succession of populations at genus-level resolution. Taxonomically distinct expressions of carbohydrate-active enzymes (transporters; in particular, TonB-dependent transporters) and phosphate acquisition strategies were found, indicating that distinct populations of Bacteroidetes, Gammaproteobacteria, and Alphaproteobacteria are specialized for successive decomposition of algal-derived organic matter. Our results suggest that algal substrate availability provided a series of ecological niches in which specialized populations could bloom. This reveals how planktonic species, despite their seemingly homogeneous habitat, can evade extinction by direct competition.

An atlas of two-dimensional materials
Pere Miró, Martha Audiffred, Thomas Heine
2014· Chemical Society Reviews1.5Kdoi:10.1039/c4cs00102h

The discovery of graphene and other two-dimensional (2D) materials together with recent advances in exfoliation techniques have set the foundations for the manufacturing of single layered sheets from any layered 3D material. The family of 2D materials encompasses a wide selection of compositions including almost all the elements of the periodic table. This derives into a rich variety of electronic properties including metals, semimetals, insulators and semiconductors with direct and indirect band gaps ranging from ultraviolet to infrared throughout the visible range. Thus, they have the potential to play a fundamental role in the future of nanoelectronics, optoelectronics and the assembly of novel ultrathin and flexible devices. We categorize the 2D materials according to their structure, composition and electronic properties. In this review we distinguish atomically thin materials (graphene, silicene, germanene, and their saturated forms; hexagonal boron nitride; silicon carbide), rare earth, semimetals, transition metal chalcogenides and halides, and finally synthetic organic 2D materials, exemplified by 2D covalent organic frameworks. Our exhaustive data collection presented in this Atlas demonstrates the large diversity of electronic properties, including band gaps and electron mobilities. The key points of modern computational approaches applied to 2D materials are presented with special emphasis to cover their range of application, peculiarities and pitfalls.