Friday 21 December 2018

Hypothetical Model to Predict PTPTN Defaulter – An Illustration


In regression analysis, logistic regression (“logit”) uses a logistic function to model a binary dependent variable.  It is used in various fields, including machine learning, most medical fields, and social sciences (Read more here).  In a business application, it can be used to predict the likelihood of a homeowner defaulting on a mortgage, by inputting homeowner’s data such as age, income, credit scores and other relevant information (Read more here).

Data such as school exams results, courses offered by University, type of schools, urban or rural, number of siblings, attendance records and other relevant personal data could be used as input for the logit model.

We do not have the information so this article will use “artificial” data to illustrate the capability of the logit model.  For simplicity, only five (5) independent variables are considered in the logit model.

·        Number of A’s in Sijil Pelajaran Malaysia (SPM) = X1
·        Course offered by University – Science (When X2 = X3 = X4 = 0)
·        Course offered by University – Technology = X2
·        Course offered by University – Art = X3
·        Courser offered by University – Business = X4

The following picture is the artificial data for the logit model.  A total of 500 rows of data were created.  The first column shows whether the student will default the loan, 1 means default while 0 means no default.  Second column is the number of A’s that the student scored in SPM.  Column 3 to 5 are the courses offered by University to that student (triple zero means the course offered is Science).  While the last 4 columns are the equations for logit model.



Once the regression coefficients are determined by using Maximum Likelihood Estimation (MLE), the probability equation is


For example, let’s say a student scored 9 A’s in SPM and was offered a Science course in University, the chances for this student to default PTPTN loan is 0.24.  On the other hand, a student who scored 4 A’s in SPM and was offered an Arts course in University, the default probability is higher at 0.88.

So how could we use this model?  For those students who have higher default rate, a guarantor might be required.  If a complete credit rating system is established, government could potentially offload their financial burden to the private sector.  The above framework is possibly useful for PTPTN in evaluating applicants for its loans. 

No comments:

Post a Comment