NobleBlocks

Singapore University of Technology and Design

UniversitySingapore, SG.01, Singapore

Research output, citation impact, and the most-cited recent papers from Singapore University of Technology and Design (Singapore). Aggregated across the NobleBlocks index of 300M+ scholarly works.

Total works
14.0K
Citations
828.5K
h-index
309
i10-index
13.2K
Also known as
Singapore University of Technology and Design新加坡科技设计大学

Top-cited papers from Singapore University of Technology and Design

On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
Sebastian Bach, Alexander Binder, Grégoire Montavon, Frederick Klauschen +2 more
2015· PLoS ONE4.6Kdoi:10.1371/journal.pone.0130140

Understanding and interpreting classification decisions of automated image classification systems is of high value in many applications, as it allows to verify the reasoning of the system and provides additional information to the human expert. Although machine learning methods are solving very successfully a plethora of tasks, they have in most cases the disadvantage of acting as a black box, not providing any information about what made them arrive at a particular decision. This work proposes a general solution to the problem of understanding classification decisions by pixel-wise decomposition of nonlinear classifiers. We introduce a methodology that allows to visualize the contributions of single pixels to predictions for kernel-based classifiers over Bag of Words features and for multilayered neural networks. These pixel contributions can be visualized as heatmaps and are provided to a human expert who can intuitively not only verify the validity of the classification decision, but also focus further analysis on regions of potential interest. We evaluate our method for classifiers trained on PASCAL VOC 2009 images, synthetic image data containing geometric shapes, the MNIST handwritten digits data set and for the pre-trained ImageNet model available as part of the Caffe open source package.

Reconfigurable Intelligent Surfaces for Energy Efficiency in Wireless Communication
Chongwen Huang, Alessio Zappone, George C. Alexandropoulos, Mérouane Debbah +1 more
2018· arXiv (Cornell University)3.8Kdoi:10.1109/twc.2019.2922609

The adoption of a Reconfigurable Intelligent Surface (RIS) for downlink multi-user communication from a multi-antenna base station is investigated in this paper. We develop energy-efficient designs for both the transmit power allocation and the phase shifts of the surface reflecting elements, subject to individual link budget guarantees for the mobile users. This leads to non-convex design optimization problems for which to tackle we propose two computationally affordable approaches, capitalizing on alternating maximization, gradient descent search, and sequential fractional programming. Specifically, one algorithm employs gradient descent for obtaining the RIS phase coefficients, and fractional programming for optimal transmit power allocation. Instead, the second algorithm employs sequential fractional programming for the optimization of the RIS phase shifts. In addition, a realistic power consumption model for RIS-based systems is presented, and the performance of the proposed methods is analyzed in a realistic outdoor environment. In particular, our results show that the proposed RIS-based resource allocation methods are able to provide up to $300\%$ higher energy efficiency, in comparison with the use of regular multi-antenna amplify-and-forward relaying.

Smart Radio Environments Empowered by Reconfigurable Intelligent Surfaces: How it Works, State of Research, and Road Ahead
Marco Di Renzo, Alessio Zappone, Mérouane Debbah, Mohamed‐Slim Alouini +3 more
2020· arXiv (Cornell University)3.1Kdoi:10.1109/jsac.2020.3007211

Reconfigurable intelligent surfaces (RISs) are an emerging transmission technology for application to wireless communications. RISs can be realized in different ways, which include (i) large arrays of inexpensive antennas that are usually spaced half of the wavelength apart; and (ii) metamaterial-based planar or conformal large surfaces whose scattering elements have sizes and inter-distances much smaller than the wavelength. Compared with other transmission technologies, e.g., phased arrays, multi-antenna transmitters, and relays, RISs require the largest number of scattering elements, but each of them needs to be backed by the fewest and least costly components. Also, no power amplifiers are usually needed. For these reasons, RISs constitute a promising software-defined architecture that can be realized at reduced cost, size, weight, and power (C-SWaP design), and are regarded as an enabling technology for realizing the emerging concept of smart radio environments (SREs). In this paper, we (i) introduce the emerging research field of RIS-empowered SREs; (ii) overview the most suitable applications of RISs in wireless networks; (iii) present an electromagnetic-based communication-theoretic framework for analyzing and optimizing metamaterial-based RISs; (iv) provide a comprehensive overview of the current state of research; and (v) discuss the most important research issues to tackle. Owing to the interdisciplinary essence of RIS-empowered SREs, finally, we put forth the need of reconciling and reuniting C. E. Shannon’s mathematical theory of communication with G. Green’s and J. C. Maxwell’s mathematical theories of electromagnetism for appropriately modeling, analyzing, optimizing, and deploying future wireless networks empowered by RISs.

Federated Learning With Differential Privacy: Algorithms and Performance Analysis
Kang Wei, Jun Li, Ming Ding, Chuan Ma +4 more
2020· IEEE Transactions on Information Forensics and Security2.2Kdoi:10.1109/tifs.2020.2988575

Federated learning (FL), as a type of distributed machine learning, is capable of significantly preserving clients’ private data from being exposed to adversaries. Nevertheless, private information can still be divulged by analyzing uploaded parameters from clients, e.g., weights trained in deep neural networks. In this paper, to effectively prevent information leakage, we propose a novel framework based on the concept of differential privacy (DP), in which artificial noise is added to parameters at the clients’ side before aggregating, namely, noising before model aggregation FL (NbAFL). First, we prove that the NbAFL can satisfy DP under distinct protection levels by properly adapting different variances of artificial noise. Then we develop a theoretical convergence bound on the loss function of the trained FL model in the NbAFL. Specifically, the theoretical bound reveals the following three key properties: 1) there is a tradeoff between convergence performance and privacy protection levels, i.e., better convergence performance leads to a lower protection level; 2) given a fixed privacy protection level, increasing the number <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$N$ </tex-math></inline-formula> of overall clients participating in FL can improve the convergence performance; and 3) there is an optimal number aggregation times (communication rounds) in terms of convergence performance for a given protection level. Furthermore, we propose a <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$K$ </tex-math></inline-formula> -client random scheduling strategy, where <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$K$ </tex-math></inline-formula> ( <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$1\leq K&lt; N$ </tex-math></inline-formula> ) clients are randomly selected from the <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$N$ </tex-math></inline-formula> overall clients to participate in each aggregation. We also develop a corresponding convergence bound for the loss function in this case and the <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$K$ </tex-math></inline-formula> -client random scheduling strategy also retains the above three properties. Moreover, we find that there is an optimal <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$K$ </tex-math></inline-formula> that achieves the best convergence performance at a fixed privacy level. Evaluations demonstrate that our theoretical results are consistent with simulations, thereby facilitating the design of various privacy-preserving FL algorithms with different tradeoff requirements on convergence performance and privacy levels.

Smart radio environments empowered by reconfigurable AI meta-surfaces: an idea whose time has come
Marco Di Renzo, Mérouane Debbah, Dinh-Thuy Phan-Huy, Alessio Zappone +4 more
2019· EURASIP Journal on Wireless Communications and Networking1.8Kdoi:10.1186/s13638-019-1438-9

Future wireless networks are expected to constitute a distributed intelligent wireless communications, sensing, and computing platform, which will have the challenging requirement of interconnecting the physical and digital worlds in a seamless and sustainable manner. Currently, two main factors prevent wireless network operators from building such networks: (1) the lack of control of the wireless environment, whose impact on the radio waves cannot be customized, and (2) the current operation of wireless radios, which consume a lot of power because new signals are generated whenever data has to be transmitted. In this paper, we challenge the usual “more data needs more power and emission of radio waves” status quo, and motivate that future wireless networks necessitate a smart radio environment: a transformative wireless concept, where the environmental objects are coated with artificial thin films of electromagnetic and reconfigurable material (that are referred to as reconfigurable intelligent meta-surfaces), which are capable of sensing the environment and of applying customized transformations to the radio waves. Smart radio environments have the potential to provide future wireless networks with uninterrupted wireless connectivity, and with the capability of transmitting data without generating new signals but recycling existing radio waves. We will discuss, in particular, two major types of reconfigurable intelligent meta-surfaces applied to wireless networks. The first type of meta-surfaces will be embedded into, e.g., walls, and will be directly controlled by the wireless network operators via a software controller in order to shape the radio waves for, e.g., improving the network coverage. The second type of meta-surfaces will be embedded into objects, e.g., smart t-shirts with sensors for health monitoring, and will backscatter the radio waves generated by cellular base stations in order to report their sensed data to mobile phones. These functionalities will enable wireless network operators to offer new services without the emission of additional radio waves, but by recycling those already existing for other purposes. This paper overviews the current research efforts on smart radio environments, the enabling technologies to realize them in practice, the need of new communication-theoretic models for their analysis and design, and the long-term and open research issues to be solved towards their massive deployment. In a nutshell, this paper is focused on discussing how the availability of reconfigurable intelligent meta-surfaces will allow wireless network operators to redesign common and well-known network communication paradigms.

GraRep
Shaosheng Cao, Wei Lu, Qiongkai Xu
20151.6Kdoi:10.1145/2806416.2806512

In this paper, we present {GraRep}, a novel model for learning vertex representations of weighted graphs. This model learns low dimensional vectors to represent vertices appearing in a graph and, unlike existing work, integrates global structural information of the graph into the learning process. We also formally analyze the connections between our work and several previous research efforts, including the DeepWalk model of Perozzi et al. as well as the skip-gram model with negative sampling of Mikolov et al.

Explaining nonlinear classification decisions with deep Taylor decomposition
Grégoire Montavon, Sebastian Lapuschkin, Alexander Binder, Wojciech Samek +1 more
2016· Pattern Recognition1.4Kdoi:10.1016/j.patcog.2016.11.008

Nonlinear methods such as Deep Neural Networks (DNNs) are the gold standard for various challenging machine learning problems such as image recognition. Although these methods perform impressively well, they have a significant disadvantage, the lack of transparency, limiting the interpretability of the solution and thus the scope of application in practice. Especially DNNs act as black boxes due to their multilayer nonlinear structure. In this paper we introduce a novel methodology for interpreting generic multilayer neural networks by decomposing the network classification decision into contributions of its input elements. Although our focus is on image classification, the method is applicable to a broad set of input data, learning tasks and network architectures. Our method called deep Taylor decomposition efficiently utilizes the structure of the network by backpropagating the explanations from the output to the input layer. We evaluate the proposed method empirically on the MNIST and ILSVRC data sets.

N-BaIoT—Network-Based Detection of IoT Botnet Attacks Using Deep Autoencoders
Yair Meidan, Michael Bohadana, Yael Mathov, Yisroel Mirsky +3 more
2018· IEEE Pervasive Computing1.3Kdoi:10.1109/mprv.2018.03367731

The proliferation of IoT devices that can be more easily compromised than desktop computers has led to an increase in IoT-based botnet attacks. To mitigate this threat, there is a need for new methods that detect attacks launched from compromised IoT devices and that differentiate between hours- and milliseconds-long IoT-based attacks. In this article, we propose a novel network-based anomaly detection method for the IoT called N-BaIoT that extracts behavior snapshots of the network and uses deep autoencoders to detect anomalous network traffic from compromised IoT devices. To evaluate our method, we infected nine commercial IoT devices in our lab with two widely known IoT-based botnets, Mirai and BASHLITE. The evaluation results demonstrated our proposed methods ability to accurately and instantly detect the attacks as they were being launched from the compromised IoT devices that were part of a botnet.

Carbon Footprint and the Management of Supply Chains: Insights From Simple Models
Saif Benjaafar, Yanzhi Li, Mark S. Daskin
2012· IEEE Transactions on Automation Science and Engineering1.3Kdoi:10.1109/tase.2012.2203304

Using relatively simple and widely used models, we illustrate how carbon emission concerns could be integrated into operational decision-making with regard to procurement, production, and inventory management. We show how, by associating carbon emission parameters with various decision variables, traditional models can be modified to support decision-making that accounts for both cost and carbon footprint. We examine how the values of these parameters as well as the parameters of regulatory emission control policies affect cost and emissions. We use the models to study the extent to which carbon reduction requirements can be addressed by operational adjustments, as an alternative (or a supplement) to costly investments in carbon-reducing technologies. We also use the models to investigate the impact of collaboration among firms within the same supply chain on their costs and carbon emissions and study the incentives firms might have in seeking such cooperation. We provide a series of insights that highlight the impact of operational decisions on carbon emissions and the importance of operational models in evaluating the impact of different regulatory policies and in assessing the benefits of investments in more carbon efficient technologies. Note to Practitioners-Firms worldwide, responding to the threat of government legislation or to concerns raised by their own consumers or shareholders, are undertaking initiatives to reduce their carbon footprint. It is the conventional thinking that such initiatives will require either capital investments or a switch to more expensive sources of energy or input material. In this paper, we show that firms could effectively reduce their carbon emissions without significantly increasing their costs by making only operational adjustments and by collaborating with other members of their supply chain. We describe optimization models that can be used by firms to support operational decision making and supply chain collaboration, while taking into account carbon emissions. We analyze the effect of different emission regulations, including strict emission caps, taxes on emissions, cap-and-offset, and cap-and-trade, on supply chain management decisions. In particular, we show that the presence of emission regulation can significantly increase the value of supply chain collaboration.

Evaluating the Visualization of What a Deep Neural Network Has Learned
Wojciech Samek, Alexander Binder, Grégoire Montavon, Sebastian Lapuschkin +1 more
2016· IEEE Transactions on Neural Networks and Learning Systems1.1Kdoi:10.1109/tnnls.2016.2599820

Deep neural networks (DNNs) have demonstrated impressive performance in complex machine learning tasks such as image classification or speech recognition. However, due to their multilayer nonlinear structure, they are not transparent, i.e., it is hard to grasp what makes them arrive at a particular classification or recognition decision, given a new unseen data sample. Recently, several approaches have been proposed enabling one to understand and interpret the reasoning embodied in a DNN for a single test image. These methods quantify the “importance” of individual pixels with respect to the classification decision and allow a visualization in terms of a heatmap in pixel/input space. While the usefulness of heatmaps can be judged subjectively by a human, an objective quality measure is missing. In this paper, we present a general methodology based on region perturbation for evaluating ordered collections of pixels such as heatmaps. We compare heatmaps computed by three different methods on the SUN397, ILSVRC2012, and MIT Places data sets. Our main result is that the recently proposed layer-wise relevance propagation algorithm qualitatively and quantitatively provides a better explanation of what made a DNN arrive at a particular classification decision than the sensitivity-based approach or the deconvolution method. We provide theoretical arguments to explain this result and discuss its practical implications. Finally, we investigate the use of heatmaps for unsupervised assessment of the neural network performance.

Multimaterial 4D Printing with Tailorable Shape Memory Polymers
Qi Ge, Amir Hosein Sakhaei, Howon Lee, Conner K. Dunn +2 more
2016· Scientific Reports1.1Kdoi:10.1038/srep31110

We present a new 4D printing approach that can create high resolution (up to a few microns), multimaterial shape memory polymer (SMP) architectures. The approach is based on high resolution projection microstereolithography (PμSL) and uses a family of photo-curable methacrylate based copolymer networks. We designed the constituents and compositions to exhibit desired thermomechanical behavior (including rubbery modulus, glass transition temperature and failure strain which is more than 300% and larger than any existing printable materials) to enable controlled shape memory behavior. We used a high resolution, high contrast digital micro display to ensure high resolution of photo-curing methacrylate based SMPs that requires higher exposure energy than more common acrylate based polymers. An automated material exchange process enables the manufacture of 3D composite architectures from multiple photo-curable SMPs. In order to understand the behavior of the 3D composite microarchitectures, we carry out high fidelity computational simulations of their complex nonlinear, time-dependent behavior and study important design considerations including local deformation, shape fixity and free recovery rate. Simulations are in good agreement with experiments for a series of single and multimaterial components and can be used to facilitate the design of SMP 3D structures.

Deep Neural Networks for Learning Graph Representations
Shaosheng Cao, Wei Lu, Qiongkai Xu
2016· Proceedings of the AAAI Conference on Artificial Intelligence1.1Kdoi:10.1609/aaai.v30i1.10179

In this paper, we propose a novel model for learning graph representations, which generates a low-dimensional vector representation for each vertex by capturing the graph structural information. Different from other previous research efforts, we adopt a random surfing model to capture graph structural information directly, instead of using the sampling-based method for generating linear sequences proposed by Perozzi et al. (2014). The advantages of our approach will be illustrated from both theorical and empirical perspectives. We also give a new perspective for the matrix factorization method proposed by Levy and Goldberg (2014), in which the pointwise mutual information (PMI) matrix is considered as an analytical solution to the objective function of the skip-gram model with negative sampling proposed by Mikolov et al. (2013). Unlike their approach which involves the use of the SVD for finding the low-dimensitonal projections from the PMI matrix, however, the stacked denoising autoencoder is introduced in our model to extract complex features and model non-linearities. To demonstrate the effectiveness of our model, we conduct experiments on clustering and visualization tasks, employing the learned vertex representations as features. Empirical results on datasets of varying sizes show that our model outperforms other stat-of-the-art models in such tasks.

Unmasking Clever Hans predictors and assessing what machines really learn
Sebastian Lapuschkin, Stephan Wäldchen, Alexander Binder, Grégoire Montavon +2 more
2019· Nature Communications975doi:10.1038/s41467-019-08987-4

Current learning machines have successfully solved hard application problems, reaching high accuracy and displaying seemingly intelligent behavior. Here we apply recent techniques for explaining decisions of state-of-the-art learning machines and analyze various tasks from computer vision and arcade games. This showcases a spectrum of problem-solving behaviors ranging from naive and short-sighted, to well-informed and strategic. We observe that standard performance evaluation metrics can be oblivious to distinguishing these diverse problem solving behaviors. Furthermore, we propose our semi-automated Spectral Relevance Analysis that provides a practically effective way of characterizing and validating the behavior of nonlinear learning machines. This helps to assess whether a learned model indeed delivers reliably for the problem that it was conceived for. Furthermore, our work intends to add a voice of caution to the ongoing excitement about machine intelligence and pledges to evaluate and judge some of these recent successes in a more nuanced manner.

van der Waals Epitaxy of MoS<sub>2</sub> Layers Using Graphene As Growth Templates
Yumeng Shi, Wu Zhou, Ang‐Yu Lu, Wenjing Fang +4 more
2012· Nano Letters967doi:10.1021/nl204562j

We present a method for synthesizing MoS(2)/Graphene hybrid heterostructures with a growth template of graphene-covered Cu foil. Compared to other recent reports, (1, 2) a much lower growth temperature of 400 °C is required for this procedure. The chemical vapor deposition of MoS(2) on the graphene surface gives rise to single crystalline hexagonal flakes with a typical lateral size ranging from several hundred nanometers to several micrometers. The precursor (ammonium thiomolybdate) together with solvent was transported to graphene surface by a carrier gas at room temperature, which was then followed by post annealing. At an elevated temperature, the precursor self-assembles to form MoS(2) flakes epitaxially on the graphene surface via thermal decomposition. With higher amount of precursor delivered onto the graphene surface, a continuous MoS(2) film on graphene can be obtained. This simple chemical vapor deposition method provides a unique approach for the synthesis of graphene heterostructures and surface functionalization of graphene. The synthesized two-dimensional MoS(2)/Graphene hybrids possess great potential toward the development of new optical and electronic devices as well as a wide variety of newly synthesizable compounds for catalysts.

Safeguarding 5G wireless communication networks using physical layer security
Nan Yang, Lifeng Wang, Giovanni Geraci, Maged Elkashlan +2 more
2015· IEEE Communications Magazine944doi:10.1109/mcom.2015.7081071

The fifth generation (5G) network will serve as a key enabler in meeting the continuously increasing demands for future wireless applications, including an ultra-high data rate, an ultrawide radio coverage, an ultra-large number of devices, and an ultra-low latency. This article examines security, a pivotal issue in the 5G network where wireless transmissions are inherently vulnerable to security breaches. Specifically, we focus on physical layer security, which safeguards data confidentiality by exploiting the intrinsic randomness of the communications medium and reaping the benefits offered by the disruptive technologies to 5G. Among various technologies, the three most promising ones are discussed: heterogenous networks, massive multiple-input multiple-output, and millimeter wave. On the basis of the key principles of each technology, we identify the rich opportunities and the outstanding challenges that security designers must tackle. Such an identification is expected to decisively advance the understanding of future physical layer security.

Recent Advances in Indoor Localization: A Survey on Theoretical Approaches and Applications
Ali Yassin, Youssef Nasser, Mariette Awad, Ahmed Al‐Dubai +4 more
2016· IEEE Communications Surveys & Tutorials902doi:10.1109/comst.2016.2632427

The availability of location information has become a key factor in today's communications systems allowing location based services. In outdoor scenarios, the mobile terminal position is obtained with high accuracy thanks to the global positioning system (GPS) or to the standalone cellular systems. However, the main problem of GPS and cellular systems resides in the indoor environment and in scenarios with deep shadowing effects where the satellite or cellular signals are broken. In this paper, we survey different technologies and methodologies for indoor and outdoor localization with an emphasis on indoor methodologies and concepts. Additionally, we discuss in this review different localization-based applications, where the location information is critical to estimate. Finally, a comprehensive discussion of the challenges in terms of accuracy, cost, complexity, security, scalability, etc. is given. The aim of this survey is to provide a comprehensive overview of existing efforts as well as auspicious and anticipated dimensions for future work in indoor localization techniques and applications.

Offloading in Mobile Edge Computing: Task Allocation and Computational Frequency Scaling
Thinh Quang Thinh, Jianhua Tang, Quang Duy Lã, Tony Q. S. Quek
2017· IEEE Transactions on Communications891doi:10.1109/tcomm.2017.2699660

In this paper, we propose an optimization framework of offloading from a single mobile device (MD) to multiple edge devices. We aim to minimize both total tasks' execution latency and the MD's energy consumption by jointly optimizing the task allocation decision and the MD's central process unit (CPU) frequency. This paper considers two cases for the MD, i.e., fixed CPU frequency and elastic CPU frequency. Since these problems are NP-hard, we propose a linear relaxation-based approach and a semidefinite relaxation (SDR)-based approach for the fixed CPU frequency case, and an exhaustive search-based approach and an SDR-based approach for the elastic CPU frequency case. Our simulation results show that the SDR-based algorithms achieve near optimal performance. Performance improvement can be obtained with the proposed scheme in terms of energy consumption and tasks' execution latency when multiple edge devices and elastic CPU frequency are considered. Finally, we show that the MD's flexible CPU range can have an impact on the task allocation.

MISA
Devamanyu Hazarika, Roger Zimmermann, Soujanya Poria
2020865doi:10.1145/3394171.3413678

Multimodal Sentiment Analysis is an active area of research that leverages multimodal signals for affective understanding of user-generated videos. The predominant approach, addressing this task, has been to develop sophisticated fusion techniques. However, the heterogeneous nature of the signals creates distributional modality gaps that pose significant challenges. In this paper, we aim to learn effective modality representations to aid the process of fusion. We propose a novel framework, MISA, which projects each modality to two distinct subspaces. The first subspace is modality-invariant, where the representations across modalities learn their commonalities and reduce the modality gap. The second subspace is modality-specific, which is private to each modality and captures their characteristic features. These representations provide a holistic view of the multimodal data, which is used for fusion that leads to task predictions. Our experiments on popular sentiment analysis benchmarks, MOSI and MOSEI, demonstrate significant gains over state-of-the-art models. We also consider the task of Multimodal Humor Detection and experiment on the recently proposed UR_FUNNY dataset. Here too, our model fares better than strong baselines, establishing MISA as a useful multimodal framework.

Chinese NER Using Lattice LSTM
Yue Zhang, Jie Yang
2018859doi:10.18653/v1/p18-1144

We investigate a lattice-structured LSTM model for Chinese NER, which encodes a sequence of input characters as well as all potential words that match a lexicon. Compared with character-based methods, our model explicitly leverages word and word sequence information. Compared with word-based methods, lattice LSTM does not suffer from segmentation errors. Gated recurrent cells allow our model to choose the most relevant characters and words from a sentence for better NER results. Experiments on various datasets show that lattice LSTM outperforms both word-based and character-based LSTM baselines, achieving the best results.

Breaking the Speed Limits of Phase-Change Memory
Desmond K. Loke, T. H. Lee, W. J. Wang, Lu Shi +4 more
2012· Science832doi:10.1126/science.1221561

Phase-change random-access memory (PCRAM) is one of the leading candidates for next-generation data-storage devices, but the trade-off between crystallization (writing) speed and amorphous-phase stability (data retention) presents a key challenge. We control the crystallization kinetics of a phase-change material by applying a constant low voltage via prestructural ordering (incubation) effects. A crystallization speed of 500 picoseconds was achieved, as well as high-speed reversible switching using 500-picosecond pulses. Ab initio molecular dynamics simulations reveal the phase-change kinetics in PCRAM devices and the structural origin of the incubation-assisted increase in crystallization speed. This paves the way for achieving a broadly applicable memory device, capable of nonvolatile operations beyond gigahertz data-transfer rates.