Invention Title:

COUNTERFACTUALS WITH FEATURE PREFERENCES FOR CONSISTENT AND DIVERSE EXPLANATIONS

Publication number:

US20250278669

Publication date:

2025-09-04

Section:

Physics

Class:

G06N20/00

Inventors:

Sumanta Mukherjee 🇮🇳 Bangalore, India

Roman Vaculin 🇺🇸 Larchmont, NY, United States

Amaresh Rajasekharan 🇺🇸 Wappingers Falls, NY, United States

Kanthi Sarpatwar 🇺🇸 Briarcliff Manor, NY, United States

Natalia Lucienne Martinez Gil 🇺🇸 Ossining, NY, United States

Assignee:

INTERNATIONAL BUSINESS MACHINES CORPORATION 🇺🇸 ARMONK, NY, United States

Applicant:

INTERNATIONAL BUSINESS MACHINES CORPORATION 🇺🇸 Armonk, NY, United States

Smart overview of the Invention

The patent application introduces a system for generating counterfactual explanations in artificial intelligence (AI) models. It focuses on enhancing the explainability of complex machine learning models, such as deep neural networks (DNNs), by providing actionable insights. Counterfactual explanations help identify minimal changes needed in input data to alter the model's output, aiming for more desirable results.

Counterfactual Engine

A key component of the system is the Counterfactual Engine, which detects features of a machine learning model and computes objectives based on modifying weights and perturbations of these features. This process involves transforming counterfactuals to achieve consistent and diverse explanations, enabling better understanding and manipulation of AI model outcomes.

Implementation Details

The system can be implemented in various forms, including a computer program product or a computer system with specific hardware components. The Counterfactual Engine uses weights that penalize feature perturbations, ensuring the changes are minimal yet effective. The engine iteratively updates counterfactuals and weighting vectors to refine explanations.

Technical Aspects

Illustrative embodiments highlight the use of invariant maximum values for features and distance algorithms to minimize modifications. The system supports multiple configurations, allowing for alterations and adaptations suited to different data processing environments. It is designed to work with diverse data types and storage devices, offering flexibility in deployment.

Scope and Flexibility

The patent emphasizes that the described embodiments are examples and not limitations, suggesting that many variations are possible within its scope. The system can be integrated with various existing technologies, architectures, and applications, enhancing its applicability across different AI-driven fields. This adaptability ensures broad utility in improving AI model transparency.