Document Type
Thesis - University Access Only
Award Date
2010
Degree Name
Master of Science (MS)
Department / School
Mathematics and Statistics
Abstract
A business must know who its customers are in order to meet its goals and operate efficiently. The products and services available from a business, along with marketing strategy, should reflect a business' current or goal customers in order to maximize profits and efficiency. This problem can be solved with a data mining technique called cluster analysis. Cluster analysis is a knowledge discovery tool capable of summarizing hundreds of thousands of observations on several variables by finding groups within data. The main objective of this thesis was to determine how many different types of customers applied for student loans at a particular student loan company. The original dataset consisted of more than 470,000 student-loan applications received by the company between April 2008 and March 2009. Different clustering methods were employed in order to answer these questions, including hierarchical agglomerative and k-means clustering. The range of possible cluster solutions was found to be two to six, and each of these solutions was run using k-means clustering on the entire original dataset. After comparing the results, the most appropriate solution was found to be three clusters. The three cluster solution was validated using several different methods which included running the three cluster solution using k-means clustering on random samples of the original data and also on one year prior's data (April '07 to March '08 student applicants) and the most recent year's data (April '09 to March '10). After each method, the results were analyzed to determine if three clusters similar to the original solution were found. The researcher found that the 3 clusters, representing three typical types of student loan applicants, were consistently detected and validated in different years' data as well as different subsets of the original data. The analysis also showed some evidence of a new type of customer emerging from the most recent year's data which might possibly be explained by the Great Recession of 2008 and 2009. In general, it was found that cluster analysis can successfully be used to detect customer segments of student loan applicants, which could be potentially useful for targeted marketing or corporate decision making.
Library of Congress Subject Headings
Student loans
Cluster analysis
Market segmentation
Format
application/pdf
Number of Pages
55
Publisher
South Dakota State University
Recommended Citation
Wallin, Josie, "Customer Segmentation using Cluster Analysis on Student Loan Applications" (2010). Electronic Theses and Dissertations. 1702.
https://openprairie.sdstate.edu/etd2/1702