This repository contains the code and documentation for the Quote Extraction Automation API developed during my Software Development internship at HDFC ERGO from April 2022 to June 2022. The project aimed to automate the validation, extraction, and storage of data from insurance quote Excel templates, improving the efficiency and accuracy of the policy issuance process.
- Project Overview
- Challenges with Current Process
- Modified Process
- Advantages of Modified Process
- Products
- API Code Flow
- Demo
- Software Specifications
The primary objective of this project was to create an API that automates the process of validating, extracting, and storing data from insurance quote Excel files into a database, thereby eliminating manual data entry and speeding up the policy issuance process.
- Inconsistent Excel quote templates due to:
- Changes in formulas by salespersons.
- Use of obsolete templates.
- Time-consuming & error-prone manual data entry.
- Redundancy in lead ID:
- Each customer is assigned a unique lead ID.
- A customer might upload the same Excel file multiple times, causing redundancy.
-
4 Factor Validation:
- File format check (.xlsx).
- Unique hidden encrypted value check in a random cell.
- Template format check using 5 constant labels.
- Sheet properties check (creator and title).
-
Automated Data Entry:
- Saves time (from 2 hours to 2 minutes per proposal).
- Eliminates data entry errors.
-
Managing Redundancy in Lead ID:
- Introduced an "isactive" column in each database table.
- Ensures the most recent activity is marked as active.
The API handles the following insurance products:
- Standard Fire & Special Perils Policy (SFSP)
- Burglary Insurance
- Plate Glass Insurance
For each product, the API processes sections such as:
- Customer Details
- Main Policies
- Add-On Covers
- Terms & Conditions
- Deductibles
- Warranties
- Clauses & Conditions
- Exclusions & Subjectivities
- Supplementary Clauses & Conditions
- Python: 3.8.8
- Database: Oracle 19c
- Libraries:
- Flask: 2.1.2
- Pandas: 1.4.2
- NumPy: 1.22.4
- os: 0.6.3
- sqlalchemy: 1.3.24
- openpyxl: 3.0.9
- Werkzeug: 2.1.2
- cx_Oracle: 8.3.0



