| School Type | Average Starting Salary ($) |
|---|---|
| Ivy League | 60475.00 |
| Engineering | 59057.89 |
| Liberal Arts | 45746.81 |
| Party | 45715.00 |
| State | 44126.29 |
Final Report
Data Science 1 with R (STAT 301-1)
Introduction
Overview of the Analysis
The primary goal of this analysis is to investigate college salaries and earning potential to better understand how different factors such as degree type, field of study, and institution type affect financial outcomes after graduation. This exploration is motivated by the growing need for transparency in higher education costs and returns, allowing prospective students to make informed decisions about their educational investments.
Key research questions guiding this analysis include:
- How do earnings evolve in the early years post-graduation?
- What are the top-performing fields of study and institutions in terms of financial outcomes?
- How does the earning potential of bachelor’s degrees compare to other credentials?
- What regional differences exist in starting and mid-career salaries?
To answer these questions, I used two datasets: one from Kaggle and another from College Scorecard. These datasets compliment each other to give a full picture of post-graduation outcomes, covering starting salaries, mid-career growth, and overall return on investment.
Datasets and Sources
- Kaggle: College Salaries Dataset
This dataset, provided by the Wall Street Journal (WSJ), includes salary information for various undergraduate majors and regions. It was downloaded on November 3, 2024 and is publicly available at Kaggle. Key components include:
Degrees That Pay You Back: Data on starting and mid-career salaries for undergraduate majors.
Salaries by College Type: Salary data segmented by institution type, including starting and mid-career median salaries.
Salaries by Region: Regional salary data for both starting and mid-career salaries.
- College Scorecard: U.S. Department of Education: Most Recent Cohorts Field of Study
This dataset, provided by the U.S. Department of Education, focuses on detailed earnings and debt information by field of study. It was downloaded on November 24, 2024 from the College Scorecard website. Notable features include:
- Detailed breakdowns of student debt and earnings.
- Analysis of financial outcomes across various fields of study.
Motivation for Choosing These Datasets
I chose these datasets because they provide unique but related insights into the financial outcomes of higher education. The Kaggle dataset focuses on salary trends for different majors and types of schools, giving a clear picture of mid-career earnings growth. The College Scorecard dataset looks more closely at earnings and debt for specific programs, offering detailed information about financial returns by field of study.
By using both datasets together, this analysis gives a well-rounded view of the value of higher education, exploring key questions about earning potential, salary growth, and the impact of different degrees.
Data Overiew & Quality
The datasets included in this analysis are well-structured, each presenting different levels of complexity and addressing various aspects of the research questions:
Degrees That Pay You Back
This dataset consists of 50 rows and 8 columns. It primarily contains categorical variables, with 7 out of 8 columns being categorical (e.g., Undergraduate Major, Percent Change from Starting to Mid-Career Salary). The sole numerical variable is the “Starting Median Salary,” which provides critical insight into initial earnings. This dataset is complete, with no missing values (0% missingness).
Salaries by College Type
This dataset contains 269 rows and 8 columns. All columns are categorical, including variables like “School Name” and “School Type.” An important variable for analysis is “Starting Median Salary,” which offers a comparison across institution types. Like the first dataset, this one is complete, with no missing values (0% missingness).
Salaries by Region
This dataset includes 320 rows and 8 columns. Similar to the college type dataset, all columns are categorical, with key variables including “Region” and “Starting Median Salary.” There are no missing values (0% missingness), making it a reliable source for regional comparisons.
Most Recent Cohorts Field of Study
This dataset is significantly larger than the Kaggle datasets, with thousands of rows and over 1,400 columns. To streamline the analysis, I focused on 14 key columns related to earnings, debt, and fields of study. It includes both categorical and numerical variables, such as “EARN_NE_MDN_3YR” (Earnings 3 Years Post-Graduation) and “DEBT_ALL_STGP_ANY_MDN” (Median Debt for All Students). Missing values, marked as “PS” (Privacy Suppressed), were converted to NA and excluded from calculations where necessary. This dataset’s size and complexity required additional cleaning steps, detailed in the appendix.
Data Summary
Overall, the four datasets are clean and ready for analysis. The Degrees That Pay You Back dataset is straightforward and helpful for comparing salary growth. The Most Recent Cohorts Field of Study dataset is larger and more complex but provides detailed information on earnings, debt, and fields of study. Together, these datasets give a solid base to explore earnings, return on investment, and the value of higher education.
Explorations
Institution Type and Financial Outcomes
Average Starting Median Salary by Institution Type
Table 1 shows the starting salaries by school type. Ivy League schools have the highest starting salaries, followed by Engineering schools. Liberal Arts, Party, and State schools have lower starting salaries on average.
Earnings vs. Debt by Institution Type
This scatterplot in Figure 1 compares median debt to median earnings three years after graduation, based on institution type. Ivy League and Engineering schools show higher earnings and lower debt, while State, Party, and Liberal Arts schools have more varied results, often with lower earnings.
Earnings Growth Over Time by Institution Type
This graph in Figure 2 shows that graduates from Ivy League and engineering institutions experience the highest earnings growth over time, with Ivy League graduates surpassing 75,000 dollars in three years. Graduates from state, party, and liberal arts institutions see slower earnings growth, with median salaries remaining below 60,000 dollars after three years.
Fields of Study and Salaries
Median Salary by Field of Study
This bar chart in Figure 3 shows the differences in median salaries across fields of study three years after graduation, highlighting trends by discipline. This graph shows that fields in health sciences, such as dental and medical residency programs, have the highest median earnings three years after graduation. Engineering and technology-related fields also offer competitive salaries but are slightly lower.
Average Salary Growth by Undergraduate Major
This bar chart in Figure 4 compares salary growth for different majors. Philosophy, International Relations, and Math majors see the biggest salary increases from starting to mid-career. Nursing and Physician Assistant majors have smaller increases, likely because they start with higher salaries that don’t grow as much.
Median Salary by Gender and Field of Study
This bar chart in Figure 5 shows the median salaries for the top 10 fields of study three years after graduation, comparing males and females. In five fields, males earn higher median salaries, while in the other five, females earn more. This highlights how the pay gap varies across different fields of study.
Financial Aid and Earnings
Impact of Pell Grants on Earnings
This box plot in Figure 6 compares median earnings three years after graduation for students who received Pell Grants versus those who did not. Students who did not receive Pell Grants tend to have slightly higher earnings compared to those who did, highlighting potential disparities in post-graduation income.
Random Field of Study Impact of Pell Grants on Earnings
This graph in Figure 7 shows the impact of Pell Grant status on median earnings across a random selection of fields of study. Generally, students who did not receive Pell Grants tend to have slightly higher earnings compared to those who did, but the difference varies by field. The data highlights potential disparities in post-graduation income influenced by financial aid status.
Regional Comparisons
Average Starting Salary by Region
| Region | Average Starting Salary by Region ($) |
|---|---|
| California | 48450 |
| Northeastern | 47000 |
| Western | 44850 |
| Southern | 43400 |
| Midwestern | 43100 |
Table 2 shows the variation in average starting salaries across different regions. California and the Northeastern region offer higher starting salaries compared to other areas, while the Midwestern and Southern regions have lower averages. These regional differences highlight how location can impact earning potential for recent graduates.
Average Mid-Career Salary by Region
This bar chart in Figure 8 shows that people who graduate from schools in the Northeastern region and California tend to have the highest mid-career salaries. The Midwestern and Western regions have lower mid-career averages.
Credential Levels and Earnings
Top Institutions by Median Salary With a Bachelor’s Degree
| Institution | Median Earnings (1st Year) | Median Earnings (2nd Year) | Median Earnings (3rd Year) |
|---|---|---|---|
| Harvey Mudd College | 118973.0 | 122038.0 | 159657.0 |
| Princeton University | 80320.0 | 75169.0 | 123910.5 |
| Samuel Merritt University | 101572.0 | 110846.0 | 121792.0 |
| Bismarck State College | 94930.0 | 104749.0 | 113181.0 |
| Northwestern Michigan College | 90576.0 | 105306.0 | 112111.0 |
| Holy Names University | 90368.0 | 86867.0 | 109835.0 |
| Pacific Union College | 92597.0 | 92410.0 | 107830.0 |
| Pacific College | 114248.0 | 101406.0 | 105513.0 |
| Franklin W Olin College of Engineering | 91352.0 | 75706.0 | 104429.0 |
| Washington University in St Louis | 75098.5 | 79869.5 | 102693.5 |
Table 3 presents the top 10 institutions ranked by median salaries for bachelor’s degree holders. It includes a yearly breakdown of earnings from the first to the third year after graduation. According to the data, Harvey Mudd College is the best institution to pursue a bachelors degree to earn a high salary followed by Princeton University.
Top Schools for Return of Investment on Bachelor’s Degrees
| Institution | Median Earnings (3rd Year) | Median Debt | ROI ($) |
|---|---|---|---|
| Harvey Mudd College | 159657 | 22949 | 136708 |
| Washington University in St Louis | 123261 | 19000 | 104261 |
| Bismarck State College | 113181 | 15452 | 97729 |
| Samuel Merritt University | 121792 | 25000 | 96792 |
| Franklin W Olin College of Engineering | 104429 | 14512 | 89917 |
| Northwestern Michigan College | 112111 | 26542 | 85569 |
| Massachusetts Institute of Technology | 95539 | 12500 | 83039 |
| Carnegie Mellon University | 102811 | 23125 | 79686 |
| Claremont McKenna College | 91488 | 12000 | 79488 |
| University of Providence | 103079 | 23847 | 79232 |
Table 4 shows that Harvey Mudd College offers the best return on investment (ROI) with high earnings and low debt. Schools like MIT and Carnegie Mellon University also rank high because their graduates have low debt. Even smaller schools, like Bismarck State College and Northwestern Michigan College, provide good value, proving you don’t need to attend a big-name university for a strong financial return.
Median Earnings by Credential Level
This bar chart in Figure 9 compares median earnings across credential levels, highlighting the earning potential of higher degrees such as bachelor’s, master’s, and doctoral programs. From this bar chart, it’s evident that Doctorate degrees have the highest median earnings.
Earnings Growth by Credential Level
| Credential Level | Median Earnings (1st Year) | Median Earnings (3rd Year) | Growth (%) |
|---|---|---|---|
| Bachelor's Degree | 36425.0 | 43291.5 | 18.85 |
| Master's Degree | 53614.5 | 57861.0 | 7.92 |
| Doctoral Degree | 71463.0 | 77441.0 | 8.37 |
Table 5 shows how earnings grow from the first to the third year after graduation for different credential levels. Bachelor’s degree holders see the highest percentage growth in earnings, around 19%, even though they start with the lowest salaries. Master’s and Doctoral degrees have smaller percentage increases, likely because their starting salaries are already higher. This highlights the different earning patterns for each level of education.
Conclusion
This analysis highlights the impact of education choices on financial outcomes after graduation. By examining starting salaries, mid-career growth, and earnings by credential levels, it’s evident that factors such as field of study, institution type, and location play an important role in shaping earning potential.
The findings demonstrate that higher degrees generally lead to higher salaries, but bachelor’s degree holders experience the largest percentage growth in early-career earnings. Regional differences and the financial outcomes of specific majors also demonstrate the importance of careful planning when making educational investments. Together, these insights provide a useful framework for understanding the value of higher education and its long-term financial implications.
Comment on AI
ChatGPT assisted in editing my executive summary to make it clear and concise while also helping generate graph ideas for my exploratory data analysis. Furthermore, ChatGPT provided guidance on cleaning the College Scorecard data since it was rather complex and aided in creating functions to highlight key findings.