National Health Insurance Service (NHIS) – Machine Learning-Based Fraud Detection and Prediction System
Overview
The National Health Insurance Service (NHIS) of the Republic of Korea operates a machine learning-based fraud detection and prediction system designed to identify irregularities and forecast potential misuse in healthcare provider billing across the country's universal health insurance programme. South Korea's National Health Insurance system covers virtually all residents, with healthcare services delivered predominantly by private-sector providers who submit claims to NHIS for reimbursement. The fraud detection system targets healthcare facilities that are not established in accordance with current regulations, for example those operated by unqualified persons seeking to maximise profit through fraudulent insurance claims, with a particular focus on long-term care benefits and health insurance eligibility fraud.
NHIS began applying artificial intelligence to fraud detection from 2020, building upon earlier rule-based detection approaches. The system operates as a hybrid detection architecture that combines traditional rule-based screening with AI predictive models to identify information with a high probability of fraud. The machine learning component employs a range of supervised learning techniques, including deep learning, random forest, gradient boosting, logistic regression, and support vector machine (SVM) algorithms. These models are trained on NHIS healthcare big data, which includes socio-demographic variables, disease and treatment history records, and provider billing patterns. The system generates ranked alerts for investigator review, flagging cases with the highest probability of fraudulent activity for priority examination by human investigators.
The transition from experiential, manual screening approaches to data-driven management represented a significant operational shift for NHIS. Previously, fraud detection relied heavily on the experience and intuition of individual investigators reviewing claims data. The machine learning system enables proactive data analysis to identify and manage fraudulent activities systematically rather than reactively. The system processes administrative health insurance claims, provider billing records, and beneficiary identifiers, with beneficiary data pseudonymised in accordance with the Personal Information Protection Act (PIPA). PIPA, one of the strictest data protection frameworks globally, classifies health data as sensitive information under Article 23 and imposes stringent requirements on data controllers, including mandatory designation of a Chief Privacy Officer, data protection impact assessments, and penalties for non-compliance including corrective orders, penalty surcharges, and potential criminal prosecution.
In terms of documented outcomes, the fraud detection system has demonstrated substantial results. Between 2014 and 2021, NHIS detected 567,820 cases of fraud, equivalent to approximately 174 million United States dollars in claim value. In 2023, NHIS enhanced its fraud detection system to combat evolving forms of fraud, uncovering 18.534 billion Korean won (approximately 14 million USD) in unjust enrichment through the machine learning-based detection and prediction system. These results reflect a significant curbing of financial leakage in long-term care financing and healthcare claims processing.
The human oversight model follows a human-in-the-loop paradigm. Investigators manually review and adjudicate all cases flagged by the system before any enforcement action or sanction is imposed. The AI system outputs inform compliance investigations but do not autonomously impose sanctions or alter entitlements. This approach ensures that algorithmic outputs serve as decision support tools rather than automated decision-makers, reflecting the high-criticality nature of the decisions involved, which can affect healthcare provider licensing, financial penalties, and access to the national health insurance system.
The system operates within a governance framework defined by several key pieces of legislation. The Personal Information Protection Act (PIPA) governs the processing of personal and sensitive health data, requiring pseudonymisation for analytical purposes and imposing strict data handling obligations. The Digital Government Act and the National Data Innovation Strategy provide the broader policy framework for data-driven governance and digital transformation across Korean government agencies, including NHIS. NHIS is also subject to internal audit controls and Ministry of Health and Welfare data-ethics oversight.
The technical infrastructure is reported to operate on NHIS on-premise systems within national government cloud architecture, with data maintained under domestic jurisdiction. No public disclosure has been made of specific third-party vendors or commercial software partners involved in developing or maintaining the machine learning models, and the technical architecture of the AI pipeline has not been publicly documented in detail. The International Social Security Association (ISSA) has recognised the NHIS fraud detection system as a good practice example for data-driven innovation in social security across the Asia and Pacific region, documenting the system in multiple publications and good practice awards.
Classification
AI Capabilities
Use Cases
Social Protection Functions
| SP Pillar (Primary) | Social insurance |
Programme Details
| Programme Name | National Health Insurance Service (NHIS) – Machine Learning-Based Fraud Detection and Prediction System |
| Programme Type | Health Insurance |
| System Level | Implementation/delivery chain |
South Korea's universal National Health Insurance programme, administered by NHIS under the Ministry of Health and Welfare, covering virtually all residents. The ML-based fraud detection system targets fraudulent billing by healthcare providers, with particular focus on long-term care benefits and health insurance eligibility fraud.
Implementation Details
| Implementation Type | Classical ML |
| Lifecycle Stage | Monitoring, Maintenance and Decommissioning |
| Model Provenance | Not documented |
| Compute Environment | National/government cloud |
| Compute Provider | NHIS on-premise infrastructure |
| Sovereignty Quadrant | I — Sovereign AI Zone |
| Data Residency | Domestic |
| Cross-Border Transfer | None |
Risk & Oversight
| Decision Criticality | High |
| Human Oversight | HITL |
| Development Process | Fully in-house |
| Highest Risk Category | Model-related risks |
| Risk Assessment Status | Not assessed |
Risk Dimensions
Data-related risks
Governance and institutional oversight risks
Model-related risks
Operational and system integration risks
Impact Dimensions
Autonomy, human dignity and due process
Equality, non-discrimination, fairness and inclusion
Privacy and data security
Safeguards
Deployment & Outcomes
| Deployment Status | Full Production Deployment |
| Year Initiated | 2020 |
| Scale / Coverage | Nationwide — covers virtually all residents of South Korea through the universal NHI programme |
| Funding Source | NHIS operational budget (statutory national health insurance financing) |
| Technical Partners | No public disclosure of vendor or software architecture. ISSA documentation does not identify third-party technology partners. |
Outcomes / Results
Between 2014 and 2021, detected 567,820 cases of fraud equivalent to approximately USD 174 million in claim value. In 2023, uncovered KRW 18.534 billion (approximately USD 14 million) in unjust enrichment. Significantly curbed financial leakage in long-term care financing.
Sources
- SRC-001-KOR-001 Government of Korea (2024). Digital Government Act and National Data-Innovation Strategy. Seoul: Ministry of the Interior and Safety. Available at: https://www.mois.go.kr/eng/ (Accessed 31 Oct 2025).
https://www.mois.go.kr/eng/ - SRC-002-KOR-001 International Social Security Association (ISSA) (2024). Operation of Machine Learning-Based Fraud Detection and Prediction System – National Health Insurance Service (Republic of Korea). Geneva: ISSA. Available at: https://www.issa.int/gp/259276 (Accessed 24 Mar 2026).
https://www.issa.int/gp/259276 - SRC-004-KOR-001 ISSA (2024) 'Detecting fraud in health care through emerging technologies', issa.int. Available at: https://www.issa.int/analysis/detecting-fraud-health-care-through-emerging-technologies (Accessed: 27 March 2026).
https://www.issa.int/analysis/detecting-fraud-health-care-through-emerging-technologies - SRC-005-KOR-001 ISSA (2024) 'Data-driven innovation in social security: Good practices from Asia and the Pacific', issa.int. Available at: https://www.issa.int/analysis/data-driven-innovation-social-security-good-practices-asia-and-pacific (Accessed: 27 March 2026).
https://www.issa.int/analysis/data-driven-innovation-social-security-good-practices-asia-and-pacific - SRC-003-KOR-001 National Health Insurance Service (NHIS) (2025). Illegality and Fraud Information System in NHIS – Fraud Detection. Wonju: NHIS Global Hub. Available at: https://nhis-globalhub.com/images/qr/202502/pdf/1105_14.pdf (Accessed 24 Mar 2026).
https://nhis-globalhub.com/images/qr/202502/pdf/1105_14.pdf
How to Cite
DCI AI Hub (2026). 'National Health Insurance Service (NHIS) – Machine Learning-Based Fraud Detection and Prediction System', AI Hub AI Tracker, case KOR-001. Digital Convergence Initiative. Available at: https://socialprotectionai.org/use-case/KOR-001