12:00 - 12:50 PM, Wednesday, April 21, via Gather.Town
Poster #1 |
Name: Neil Callahan |
|
Title: Predicting Success of College Running Backs in the
NFL |
Abstract: This poster will
look at relationship between National Football League (NFL) draft picks from National
Collegiate Athletic Association (NCAA) football programs and the success of
these players in the NFL. For this
project data was collected on running backs who were drafted from 2005 to
2020. The goal was to build a model to predict whether players would be
successful in the NFL. I used four variables to predict the outcome. Various
predictive models were built but ultimately the best model was a Naïve Bayes
model that was around 80% accurate at correctly classifying busts and
successes. Career yards turned out to be the most important factor in making
predictions while BMI was the least important. The four distribution graphs
compare the variables against the outcome and helped in making decisions
about cutoffs when classifying the players as busts or successes. |
Poster #2 |
Name: Joe
Kulas |
|
Title: Will
Minor League Baseball Players Make it to the Major Leagues? |
Abstract: Many minor
league baseball players never make it to the majors, especially given that
there are many more players in the minor leagues than there are spots
available on major league rosters. The goal of this project was to use
predictive modeling to investigate which factors predict whether current
minor leaguers will make it to the majors in the future. I collected data on
minor league baseball statistics for current and former professional baseball
players. Using this data, I implemented multiple different prediction methods
and used the misclassification rates to determine which model performed the
best. The random forest model was found to be superior to the other methods.
A few of the most important factors for predicting whether pitchers make it
to the majors are strikeouts, games played and hits allowed, and batter’s
games played, at-bats, and hits. Finally, this best model predicted that only
about 120 of the thousands of current minor leaguers would make it to the
majors in the future. |
Poster #3 |
Name: Evan
Rondeau |
|
Title: Impacts
of Data on Direct Marketing |
Abstract: Many
businesses employ an analytics team to help them gain insight into industry
trends and make decisions regarding workflow and revenue. What benefits can this offer to a business
that does not employ such a team? My
project will show the effect of a short-term internship and the effect this
work had on a marketing campaign surrounding a webinar series. |
Poster #4 |
Name: Thomas
Veenker |
|
Title: Analyzing
and Predicting the Success of Reddit User Submissions |
Abstract: For this
project, I examined user submissions to Reddit, a popular social news
aggregation website, to determine what factors generated community approval
and lead to higher visibility. To
obtain the data, I created a unique Reddit API, learned basic programming in
Python, and taught myself how to web scrape Reddit in Python via the use of
API wrappers. After scraping 25,000
user submissions from Reddit, I analyzed the data to ascertain the effects of
certain parameters (e.g., keywords, sentiment, length, submission time/date)
on the “success” of a Reddit submission, created a regression model to
predict said “success” of any user submission, and developed a general
strategy to maximize the potential visibility of a user submission. My research has promise for both
advertisers and individual users who want to broadcast to a larger audience
on Reddit. |
Poster #5 |
Name: Benjamin
Winters |
|
Title: eSports
Predictive Analysis - A Study of Hearthstone Tournaments |
Abstract: This
poster will analyze and discuss how certain factors influence game outcomes
in a tournament setting for the digital collectible card game Hearthstone.
The main forms of analysis that will be used are logistic regression and
decision trees in order to determine significant factors and to make
predictive analysis. Features under consideration of analysis will be mainly
in-game factors specifically geared towards players going first, concepts
around mana, mana being the medium with which players can interact with the
game, and different ways in which cards can influence the state of play.
Finally, the outcome of interest with which the scope of this study will be
viewed is the end result of games, that being winning or losing. |
Poster #6 |
Name: Rebecca
Barter |
|
Title: Survival Analysis |
Abstract: For
my study, I was interested in looking into biostatistics and more
specifically survival analysis. My main goal was to learn about the
statistical methods that can be applied to survival data. I obtained data that contained information
on the heart failure of patients along with several other covariates that
affected the length of survival for these patients. I learned about and applied methods such as
Kaplan-Meier and Cox Proportional Hazards. |