MENU

Zayed Shahjahan

Title: Bayesian Variable Selection in Regression with Genetics Application
Date: April 12, 2022
Time: 9:30 AM (PDT)
Location: Remote delivery

Abstract

In this project, we consider a simple new approach to variable selection in linear regression based on the Sum-of-Single-Effects model. The approach is particularly well-suited to big-data settings where variables are highly correlated and effects are sparse. The approach shares the computational simplicity and speed of traditional stepwise methods of variable selection in regression, but instead of selecting a single variable at each step, computes a distribution on variables that captures uncertainty in which variable to select. This uncertainty in variable selection is summarized conveniently by Credible Sets of variables with an attached probability for the entire set. To illustrate the approach, we apply it to a big-data problem in genetics.

Keywords: variable selection; Bayesian regression; uncertainty quantification; genomic data science