Author name cluster

Peter Szolovits

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

12 papers

2 author rows

NeurIPS Conference 2020 Conference Paper

Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation

Aaron Sonabend
Junwei Lu
Leo Anthony Celi
Tianxi Cai
Peter Szolovits

Offline Reinforcement Learning (RL) is a promising approach for learning optimal policies in environments where direct exploration is expensive or unfeasible. However, the adoption of such policies in practice is often challenging, as they are hard to interpret within the application context, and lack measures of uncertainty for the learned policy value and its decisions. To overcome these issues, we propose an Expert-Supervised RL (ESRL) framework which uses uncertainty quantification for offline policy learning. In particular, we have three contributions: 1) the method can learn safe and optimal policies through hypothesis testing, 2) ESRL allows for different levels of risk averse implementations tailored to the application context, and finally, 3) we propose a way to interpret ESRL’s policy at every state through posterior distributions, and use this framework to compute off-policy value function posteriors. We provide theoretical guarantees for our estimators and regret bounds consistent with Posterior Sampling for RL (PSRL). Sample efficiency of ESRL is independent of the chosen risk aversion threshold and quality of the behavior policy.

PDF Details

AAAI Conference 2020 Conference Paper

Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and Entailment

Di Jin
Zhijing Jin
Joey Tianyi Zhou
Peter Szolovits

Machine learning algorithms are often vulnerable to adversarial examples that have imperceptible alterations from the original counterparts but can fool the state-of-the-art models. It is helpful to evaluate or even improve the robustness of these models by exposing the maliciously crafted adversarial examples. In this paper, we present TEXTFOOLER, a simple but strong baseline to generate adversarial text. By applying it to two fundamental natural language tasks, text classiﬁcation and textual entailment, we successfully attacked three target models, including the powerful pre-trained BERT, and the widely used convolutional and recurrent neural networks. We demonstrate three advantages of this framework: (1) effective—it outperforms previous attacks by success rate and perturbation rate, (2) utility-preserving—it preserves semantic content, grammaticality, and correct types classiﬁed by humans, and (3) efﬁcient—it generates adversarial text with computational complexity linear to the text length. 1

PDF Details

AAAI Conference 2018 Conference Paper

Semi-Supervised Biomedical Translation With Cycle Wasserstein Regression GANs

Matthew McDermott
Tom Yan
Tristan Naumann
Nathan Hunt
Harini Suresh
Peter Szolovits
Marzyeh Ghassemi

The biomedical ﬁeld offers many learning tasks that share unique challenges: large amounts of unpaired data, and a high cost to generate labels. In this work, we develop a method to address these issues with semi-supervised learning in regression tasks (e. g. , translation from source to target). Our model uses adversarial signals to learn from unpaired datapoints, and imposes a cycle-loss reconstruction error penalty to regularize mappings in either direction against one another. We ﬁrst evaluate our method on synthetic experiments, demonstrating two primary advantages of the system: 1) distribution matching via the adversarial loss and 2) regularization towards invertible mappings via the cycle loss. We then show a regularization effect and improved performance when paired data is supplemented by additional unpaired data on two real biomedical regression tasks: estimating the physiological effect of medical treatments, and extrapolating gene expression (transcriptomics) signals. Our proposed technique is a promising initial step towards more robust use of adversarial signals in semi-supervised regression, and could be useful for other tasks (e. g. , causal inference or modality translation) in the biomedical ﬁeld.

PDF Details

JBHI Journal 2017 Journal Article

Predicting Social Anxiety Treatment Outcome Based on Therapeutic Email Conversations

Mark Hoogendoorn
Thomas Berger
Ava Schulz
Timo Stolz
Peter Szolovits

Predicting therapeutic outcome in the mental health domain is of utmost importance to enable therapists to provide the most effective treatment to a patient. Using information from the writings of a patient can potentially be a valuable source of information, especially now that more and more treatments involve computer-based exercises or electronic conversations between patient and therapist. In this paper, we study predictive modeling using writings of patients under treatment for a social anxiety disorder. We extract a wealth of information from the text written by patients including their usage of words, the topics they talk about, the sentiment of the messages, and the style of writing. In addition, we study trends over time with respect to those measures. We then apply machine learning algorithms to generate the predictive models. Based on a dataset of 69 patients, we are able to show that we can predict therapy outcome with an area under the curve of 0. 83 halfway through the therapy and with a precision of 0. 78 when using the full data (i. e. , the entire treatment period). Due to the limited number of participants, it is hard to generalize the results, but they do show great potential in this type of information.

Details DOI

AAAI Conference 2016 Conference Paper

Predicting ICU Mortality Risk by Grouping Temporal Trends from a Multivariate Panel of Physiologic Measurements

Yuan Luo
Yu Xin
Rohit Joshi
Leo Celi
Peter Szolovits

ICU mortality risk prediction may help clinicians take effective interventions to improve patient outcome. Existing machine learning approaches often face challenges in integrating a comprehensive panel of physiologic variables and presenting to clinicians interpretable models. We aim to improve both accuracy and interpretability of prediction models by introducing Subgraph Augmented Non-negative Matrix Factorization (SANMF) on ICU physiologic time series. SANMF converts time series into a graph representation and applies frequent subgraph mining to automatically extract temporal trends. We then apply non-negative matrix factorization to group trends in a way that approximates patient pathophysiologic states. Trend groups are then used as features in training a logistic regression model for mortality risk prediction, and are also ranked according to their contribution to mortality risk. We evaluated SANMF against four empirical models on the task of predicting mortality or survival 30 days after discharge from ICU using the observed physiologic measurements between 12 and 24 hours after admission. SANMF outperforms all comparison models, and in particular, demonstrates an improvement in AUC (0.848 vs. 0.827, p<0.002) compared to a state-of-the-art machine learning method that uses manual feature engineering. Feature analysis was performed to illuminate insights and benefits of subgraph groups in mortality risk prediction.

PDF Details

AIIM Journal 2016 Journal Article

Utilizing uncoded consultation notes from electronic medical records for predictive modeling of colorectal cancer

Mark Hoogendoorn
Peter Szolovits
Leon M.G. Moons
Mattijs E. Numans

Details DOI

AAAI Conference 2015 Conference Paper

A Multivariate Timeseries Modeling Approach to Severity of Illness Assessment and Forecasting in ICU with Sparse, Heterogeneous Clinical Data

Marzyeh Ghassemi
Marco Pimentel
Tristan Naumann
Thomas Brennan
David Clifton
Peter Szolovits
Mengling Feng

The ability to determine patient acuity (or severity of illness) has immediate practical use for clinicians. We evaluate the use of multivariate timeseries modeling with the multi-task Gaussian process (GP) models using noisy, incomplete, sparse, heterogeneous and unevenlysampled clinical data, including both physiological signals and clinical notes. The learned multi-task GP (MTGP) hyperparameters are then used to assess and forecast patient acuity. Experiments were conducted with two real clinical data sets acquired from ICU patients: firstly, estimating cerebrovascular pressure reactivity, an important indicator of secondary damage for traumatic brain injury patients, by learning the interactions between intracranial pressure and mean arterial blood pressure signals, and secondly, mortality prediction using clinical progress notes. In both cases, MTGPs provided improved results: an MTGP model provided better results than single-task GP models for signal interpolation and forecasting (0. 91 vs 0. 69 RMSE), and the use of MTGP hyperparameters obtained improved results when used as additional classification features (0. 812 vs 0. 788 AUC).

PDF Details

AIIM Journal 2009 Journal Article

The coming of age of artificial intelligence in medicine

Vimla L. Patel
Edward H. Shortliffe
Mario Stefanelli
Peter Szolovits
Michael R. Berthold
Riccardo Bellazzi
Ameen Abu-Hanna

Details DOI

AIIM Journal 2008 Journal Article

A de-identifier for medical discharge summaries

Özlem Uzuner
Tawanda C. Sibanda
Yuan Luo
Peter Szolovits

Details DOI

UAI Conference 1994 Conference Paper

Global Conditioning for Probabilistic Inference in Belief Networks

Ross D. Shachter
Stig K. Andersen
Peter Szolovits

Details

AIJ Journal 1993 Journal Article

Categorical and probabilistic reasoning in medicine revisited

Peter Szolovits
Stephen G. Pauker

Details DOI

AIJ Journal 1978 Journal Article

Categorical and probabilistic reasoning in medical diagnosis

Peter Szolovits
Stephen G. Pauker

Details DOI