Fraud Detection using Decision Tree and Random Forest Classifier
HyperParameter Tuning : GridCv & RandomCV Methods
Abstract: A large number of problems in data mining are related to fraud detection. Fraud is a common problem in auto insurance claims, health insurance claims, credit card transactions, financial transaction and so on. The data in this particular case comes from an actual auto insurance company. Each record represents an insurance claim. The last column in the table tells you whether the claim was fraudulent or not.
Learnings :
- Dealing with categorical data : OneHotEncoder, getdummies, labelencoder
- HyperParameter Tuning using GridSearch and RandomSearch Methods
- Building a Random Forest Classifier
- Model Evaluation : More focus on False Negatives
Main Testcode : Auto Insurance Fraud Detection.ipynb
Datasets available in Data folder

