Author name cluster

Ron Weiss

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

4 papers

2 author rows

NeurIPS Conference 2020 Conference Paper

Unsupervised Sound Separation Using Mixture Invariant Training

Scott Wisdom
Efthymios Tzinis
Hakan Erdogan
Ron Weiss
Kevin Wilson
John Hershey

In recent years, rapid progress has been made on the problem of single-channel sound separation using supervised training of deep neural networks. In such supervised approaches, a model is trained to predict the component sources from synthetic mixtures created by adding up isolated ground-truth sources. Reliance on this synthetic training data is problematic because good performance depends upon the degree of match between the training data and real-world audio, especially in terms of the acoustic conditions and distribution of sources. The acoustic properties can be challenging to accurately simulate, and the distribution of sound types may be hard to replicate. In this paper, we propose a completely unsupervised method, mixture invariant training (MixIT), that requires only single-channel acoustic mixtures. In MixIT, training examples are constructed by mixing together existing mixtures, and the model separates them into a variable number of latent sources, such that the separated sources can be remixed to approximate the original mixtures. We show that MixIT can achieve competitive performance compared to supervised methods on speech separation. Using MixIT in a semi-supervised learning setting enables unsupervised domain adaptation and learning from large amounts of real-world data without ground-truth source waveforms. In particular, we significantly improve reverberant speech separation performance by incorporating reverberant mixtures, train a speech enhancement system from noisy mixtures, and improve universal sound separation by incorporating a large amount of in-the-wild data.

PDF Details

NeurIPS Conference 2018 Conference Paper

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis

Ye Jia
Yu Zhang
Ron Weiss
Quan Wang
Jonathan Shen
Fei Ren
Zhifeng Chen
Patrick Nguyen

We describe a neural network-based system for text-to-speech (TTS) synthesis that is able to generate speech audio in the voice of many different speakers, including those unseen during training. Our system consists of three independently trained components: (1) a speaker encoder network, trained on a speaker verification task using an independent dataset of noisy speech from thousands of speakers without transcripts, to generate a fixed-dimensional embedding vector from seconds of reference speech from a target speaker; (2) a sequence-to-sequence synthesis network based on Tacotron 2, which generates a mel spectrogram from text, conditioned on the speaker embedding; (3) an auto-regressive WaveNet-based vocoder that converts the mel spectrogram into a sequence of time domain waveform samples. We demonstrate that the proposed model is able to transfer the knowledge of speaker variability learned by the discriminatively-trained speaker encoder to the new task, and is able to synthesize natural speech from speakers that were not seen during training. We quantify the importance of training the speaker encoder on a large and diverse speaker set in order to obtain the best generalization performance. Finally, we show that randomly sampled speaker embeddings can be used to synthesize speech in the voice of novel speakers dissimilar from those used in training, indicating that the model has learned a high quality speaker representation.

PDF Details

ICRA Conference 2014 Conference Paper

Sensors for micro bio robots via synthetic biology

Edward B. Steager
Denise Wong
Deepak Mishra
Ron Weiss
Vijay Kumar 0001

Microscale robots offer an unprecedented opportunity to perform tasks at resolutions approaching 1 μm, but the great majority of research to this point focuses on actuation and control. Potential applications for microrobots can be considerably expanded by integrating sensing, signal processing and feedback into the system. In this work, we demonstrate that technologies from the field of synthetic biology may be directly integrated into microrobotic systems to create cell-based programmable mobile sensors, with signal processors and memory units. Specifically, we integrate genetically engineered, ultraviolet light-sensing bacteria with magnetic microrobots, creating the first controllable biological microrobot that is capable of exploring, recording and reporting on the state of the microscale environment. We demonstrate two proof-of-concept prototypes: (a) an integrated microrobot platform that is able to sense biochemical signals, and (b) a microrobot platform that is able to deploy biosensor payloads to monitor biochemical signals, both in a biological environment. These results have important implications for integrated micro-bio-robotic systems for applications in biological engineering and research.

Details

JMLR Journal 2011 Journal Article

Scikit-learn: Machine Learning in Python

Fabian Pedregosa
Gaël Varoquaux
Alexandre Gramfort
Vincent Michel
Bertrand Thirion
Olivier Grisel
Mathieu Blondel
Peter Prettenhofer

Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems. This package focuses on bringing machine learning to non-specialists using a general-purpose high-level language. Emphasis is put on ease of use, performance, documentation, and API consistency. It has minimal dependencies and is distributed under the simplified BSD license, encouraging its use in both academic and commercial settings. Source code, binaries, and documentation can be downloaded from http://scikit-learn.sourceforge.net. [abs] [ pdf ][ bib ] [ code ] &copy JMLR 2011. ( edit, beta )

PDF Details