Research Interests and Details

Submitted by Amy S. Wagaman on Monday, 2/26/2024, at 11:41 AM

I have several areas of research, with an array of applications from chemistry and biology to anthropology and some work in areas of statistics and data science education.
 

Network Science:

Networks are an exciting area of research. With mathematical roots in graphs, networks have many connections to statistics and recently there has been an explosion of research into modeling networks as well as using related ideas like graphical models to estimate covariance and understand relationships between variables. A large portion of my research work has been devoted to applications of graphs/networks in statistics with some applications to chemistry, particularly with protein conformational graphs and simulating protein folding kinetics. Related work with chemistry collaborators has led to several projects for students participating in the 4CBC Biomath program (now a 5College certificate). 
 

Statistics and Data Science Education:

I've been active in statistics education with a TAS article on our multivariate data analysis course for undergraduates. More recently, I've been focused on writing in statistics - examining how our students practice their communication skills and looking at the different types of writing we ask them to engage in. Several papers are in the pipeline in regards to this, including one looking at writing in a theoretical (mathematical) statistics course and one examining appropriate scaffolding for majors in the capstone course working on a research paper.

I took over for the second edition of a Probability textbook (out in 2021), and am excited to find out how students learn/engage with the R materials for the course the next time I teach it. 

Nonparametric Statistics:

Nonparametric statistics are useful when the typical parametric conditions for common tests are not satisfied. A collaborator and I are currently working on a nonparametric test for interaction in the two-way ANOVA layout. We've also re-vamped class materials for a Nonparametrics elective course. 

Dimension Reduction and Estimation:

Many modern data sets are characterized by high dimensionality - that is, the number of variables observed may be very large, especially when compared to the number of observations . For high-dimensional data, dimension estimation and reduction can be a valuable data analysis tool. Estimating the true ``intrinsic'' dimension of the data in p-dimensions can enable the use of dimension reduction methods and thus remove the curse of high-dimensionality. I have worked on projects that developed new dimension estimation techniques for various applications.

Covariance Estimation:

Data analysis techniques such as principal components analysis (PCA), classification by linear and quadratic discriminant analysis, and many more require an estimate of the covariance matrix or its inverse. However, it is known that the sample covariance performs poorly when p is large relative to n. Thus, alternative ways of estimating covariance are needed in high-dimensional situations. Alternative methods for estimating covariance can be divided into two classes depending on whether or not they require ordered variables. My work in this area has been developing a method to bridge the gap between the two classes of methods as well as discovering ways to uncover structure using covariance estimates.

Other Areas of Interest:

I have or have had projects in several other areas including:

Multivariate Data Analysis - Clustering and Classification - specifically cluster pruning, where a clustering solution is modified to deal with unusual points or possibly merged clusters

Longitudinal Data Analysis (applications to Anthropology) - analysis of survey data
 

Studying Probability in Classic Games (like Risk) - studying conquer odds with collaborators