The content on this page was provided by an independent third party and syndicated by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

AIM Intelligence and BMW Group Examine Gaps in Evaluating Enterprise AI Policy Compliance

Research reveals LLMs follow allowlist policies but systematically fail to enforce organizational prohibitions, exposing a critical gap in enterprise AI safety

SF, CA, UNITED STATES, February 12, 2026 /EINPresswire.com/ — Seoul, South Korea / Munich, Germany – January 2026 – BMW Group and AIM Intelligence, a leading AI safety startup, today announced the publication of COMPASS (Company/Organization Policy Alignment Assessment), the first systematic framework for evaluating whether large language models (LLMs) comply with organization-specific policies. The research, now available on arXiv, reveals a critical gap that remains under-measured in current evaluation practices: models that pass standard safety benchmarks often fail dramatically when enforcing the nuanced, context-dependent rules that govern real-world business operations.

Why Enterprise AI Policies Break Down in Practice

As organizations across healthcare, finance, automotive, and government sectors rapidly adopt LLMs for customer-facing applications, the research team discovered a fundamental asymmetry that poses significant risks for policy-critical deployments.
Key Findings:
Strong Allowlist Compliance: Models reliably handle legitimate requests with over 95% accuracy
Critical Denylist Failures: Models fail to correctly refuse prohibited requests in up to 97% of cases
Catastrophic Adversarial Vulnerability: Under adversarial conditions, some models refuse fewer than 5% of policy-violating requests
“Most AI safety tests focus on whether a model behaves safely in general,” said Dasol Choi, AI Safety Researcher at AIM Intelligence. “COMPASS looks at a more practical question: can an AI system reliably follow the specific rules of an organization? Our findings show that, in many real-world deployments today, the answer is often no.”

Why Generic AI Safety Isn’t Enough

The research addresses a critical disconnect between how AI systems are evaluated and how they are deployed. While existing safety benchmarks focus on universal harms such as toxicity and violence, real enterprises operate under complex internal policies—compliance manuals, operational playbooks, legal edge cases, and brand-specific constraints.
COMPASS evaluates models across four dimensions that typical benchmarks ignore:
1. Policy Selection: Can the model identify which policy applies to a given situation?
2. Policy Interpretation: Can it reason through conditionals, exceptions, and vague clauses?
3. Conflict Resolution: When rules collide, does the model resolve conflicts as the organization intends?
4. Justification: Can the model ground its decisions in actual policy text?

“Our evaluation revealed a striking asymmetry,” noted DongGeon Lee, AI Safety Researcher at AIM Intelligence. “While models achieve near-perfect accuracy on what they can do, they remain structurally vulnerable in enforcing what they must not do. This gap persists across model scales and architectures, indicating that scaling alone cannot solve the problem.”

Industry-Scale Validation

The research team applied COMPASS across eight diverse industry scenarios—Automotive, Government, Financial, Healthcare, Travel, Telecom, Education, and Recruiting—generating and validating 5,920 queries that test both routine compliance and adversarial robustness. Fifteen state-of-the-art models were evaluated, including leading proprietary and open-source systems.

Making Misalignment Measurable

Perhaps the most significant contribution of COMPASS is transforming alignment from a philosophical concern into an engineering problem. The framework and benchmark datasets are publicly available on GitHub and Hugging Face, enabling organizations to evaluate their AI systems against their own policies.

About the Research Collaboration

This research represents a collaboration between AIM Intelligence, BMW Group, Yonsei University, Pohang University of Science and Technology, and Seoul National University. The full paper, “COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs,” is available at https://arxiv.org/abs/2601.01836.

About AIM Intelligence

AIM Intelligence is a Seoul-based AI safety company specializing in automated red-teaming, real-time guardrails, and AI monitoring solutions. Founded in 2024, AIM Intelligence serves major enterprises and conducts research across large language models, multimodal systems, autonomous agents, and emerging physical AI. The company has published over 15 research papers at top-tier conferences including ICML, ACL, NeurIPS, and IEEE.

Team Cookie Official
Team Cookie
email us here
Visit us on social media:
LinkedIn
Facebook

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

E.C.O. Builders Inc. Highlights Strategic Bathroom Renovation Priorities That Enhance Long-Term Property Value

E.C.O. Builders Inc. Highlights Strategic Bathroom Renovation Priorities That Enhance Long-Term Property Value

Sound construction, practical layouts, and materials selected for durability tend to matter far more than decorative

February 21, 2026

Entrotech and Novara Materials Partner to Accelerate U.S. Thin-Film Heater Manufacturing Leadership

Entrotech and Novara Materials Partner to Accelerate U.S. Thin-Film Heater Manufacturing Leadership

Entrotech Inc., an advanced materials manuf., announced a strategic partnership with Novara Materials, Inc., an

February 21, 2026

Filmmaker Todd J. Stein Attaches Director Steven Feder and Casting Director Adrienne Stern The Final Fight Short Film

Filmmaker Todd J. Stein Attaches Director Steven Feder and Casting Director Adrienne Stern The Final Fight Short Film

The Final Fight, a powerful proof-of-concept inspired by true events, officially attached Steven Feder as Director and

February 21, 2026

Allergy Research Group Announces Peer-Reviewed Publication Advancing Thyroid and Endocrine Integration Science

Allergy Research Group Announces Peer-Reviewed Publication Advancing Thyroid and Endocrine Integration Science

Collaborative research led by ARG’s Medical Affairs and Scientific Advisory Board reinforces the company’s commitment

February 21, 2026

Hydrographic Equipment Rentals Explained: Tools Used to Map Waterways and Terrain

Hydrographic Equipment Rentals Explained: Tools Used to Map Waterways and Terrain

Reliable mapping comes from systems working together”— Joel Chaky BATON ROUGE, LA, UNITED STATES, February 13, 2026

February 21, 2026

Aaron’s Garage Doors Introduces Energy-Efficient Garage Door Solutions

Aaron’s Garage Doors Introduces Energy-Efficient Garage Door Solutions

The new energy-efficient garage doors are engineered to minimize heat transfer between the interior of the home and the

February 21, 2026

Mid City Cleaning of Baton Rouge Shares Structured Approach to Post-Event Cleaning and Facility Recovery

Mid City Cleaning of Baton Rouge Shares Structured Approach to Post-Event Cleaning and Facility Recovery

Post-event cleaning works best when approached as a process rather than a reaction”— Falesity Mecca BATON ROUGE, LA,

February 21, 2026

Arizona’s First Genio Hypoglossal Nerve Stimulation Procedure Performed in Scottsdale

Arizona’s First Genio Hypoglossal Nerve Stimulation Procedure Performed in Scottsdale

Leading Sleep Surgeon Dr. Jordan Weiner Brings Cutting-Edge CPAP Alternative to Arizona Patients Not every state has a

February 21, 2026

NVBDC Announces the Keith King Golf Classic Supporting JROTC Scholarships

NVBDC Announces the Keith King Golf Classic Supporting JROTC Scholarships

National corporate partners and veteran-owned businesses unite to fund scholarships for America’s future leaders.

February 21, 2026

Stache Pictures Launches Worldwide Sales for SACRED EVIL with Archstone Entertainment on Board as Agent

Stache Pictures Launches Worldwide Sales for SACRED EVIL with Archstone Entertainment on Board as Agent

LOS ANGELES, CA, UNITED STATES, February 13, 2026 /EINPresswire.com/ — Stache Pictures has announced the launch of

February 21, 2026

Varcomm Holdings, Inc. and Sierra Telephone Announce Definitive Agreement

Varcomm Holdings, Inc. and Sierra Telephone Announce Definitive Agreement

Varcomm Holdings, Inc. entered into a definitive agreement to acquire Sierra Telephone and its operating affiliates

February 21, 2026

Cross-Docking & Transloading Efficiency: Data-Driven Strategies to Reduce Port Congestion Delays

Cross-Docking & Transloading Efficiency: Data-Driven Strategies to Reduce Port Congestion Delays

Operational planning and equipment control help streamline container flow near major East Coast ports. ELIZABETH, NJ,

February 21, 2026

Dynamic HVAC Celebrates More Than 20 Years of Delivering Solutions for Heating and Cooling in Clarkston, MI

Dynamic HVAC Celebrates More Than 20 Years of Delivering Solutions for Heating and Cooling in Clarkston, MI

Waterford Township, MI – Dynamic HVAC, a leading furnace and AC repair company, is thrilled to announce the celebration

February 21, 2026

Willow Ash Roofing Announces Expanded Metal Roofing Services in Mount Pleasant, SC

Willow Ash Roofing Announces Expanded Metal Roofing Services in Mount Pleasant, SC

Mt Pleasant, SC – Willow Ash Roofing, a leading roofing contractor in the Charleston and Mt Pleasant area, is excited

February 21, 2026

Exercise & Cognitive Performance: Why Physical Activity Helps the ADD Brain

Exercise & Cognitive Performance: Why Physical Activity Helps the ADD Brain

Physical activity supports the same brain systems targeted in clinical treatment”— Dr. Stanford Owen GULFPORT, LA,

February 21, 2026

Sandy Rowley Expands Nationally, Offering AI-Enhanced SEO Services at Deep Discounts for Small Local Service Businesses

Sandy Rowley Expands Nationally, Offering AI-Enhanced SEO Services at Deep Discounts for Small Local Service Businesses

Sandy Rowley expands AI-powered SEO nationwide, offering deeply discounted marketing services for local contractors,

February 21, 2026

Treemendous Tree Care LLC Announces Expanding Stump Grinding Services in Mount Clemens, MI

Treemendous Tree Care LLC Announces Expanding Stump Grinding Services in Mount Clemens, MI

Clinton Township, MI – Treemendous Tree Care LLC, a local arborist serving Southeast Michigan with excellent tree

February 21, 2026

Frogtown Roofing Plus Announces Expanded Roof Repair Services in Toledo, OH

Frogtown Roofing Plus Announces Expanded Roof Repair Services in Toledo, OH

Maumee, Ohio – Frogtown Roofing Plus, a professional and licensed roofing company, is excited to announce it’s

February 21, 2026

Longtree Tree Service Announces Expanded Stump Grinding Services in Farmington, MI

Longtree Tree Service Announces Expanded Stump Grinding Services in Farmington, MI

Southfield, MI – Longtree Tree Service, a leading tree care and arborist company, is happy to announce it’s expanding

February 21, 2026

Islands In The Sun BBQ Announces ‘Top Deals of the Year Blowout’

Islands In The Sun BBQ Announces ‘Top Deals of the Year Blowout’

Canyon Lake, CA – Islands In The Sun BBQ, a leading online store specialising in premium outdoor kitchens and grills,

February 21, 2026

Skills-Based Hiring Takes Center Stage: Whitman Associates Expands Placement Strategies for 2026

Skills-Based Hiring Takes Center Stage: Whitman Associates Expands Placement Strategies for 2026

Whitman Associates shifts to skills-based hiring, prioritizing experience over credentials as the staffing industry

February 21, 2026

Adams Pool Solutions Expands Commercial Pool Construction Division to Meet Growing Regional Demand

Adams Pool Solutions Expands Commercial Pool Construction Division to Meet Growing Regional Demand

PLEASANTON, CA – February 13, 2026 – PRESSADVANTAGE – Adams Pool Solutions has announced the expansion of its

February 21, 2026

Composite Bonding East Dulwich Cosmetic Dentist Dr Mori Shahid Recommends Treatments at The Gardens Dental Centre (Smile 4 U)

Composite Bonding East Dulwich Cosmetic Dentist Dr Mori Shahid Recommends Treatments at The Gardens Dental Centre (Smile 4 U)

London, England – February 13, 2026 – PRESSADVANTAGE – The Gardens Dental Centre (Smile 4 U) has announced the

February 21, 2026

Digital Data for Resilience: Dmitry Erokhin Shows How Online Data Strengthen Crisis Communication and Climate Adaptation

Digital Data for Resilience: Dmitry Erokhin Shows How Online Data Strengthen Crisis Communication and Climate Adaptation

Dmitry Erokhin at IIASA Laxenburg Austria shows how search and social media signals can strengthen crisis

February 21, 2026

FullNet Communications Declares Quarterly Cash Dividend

FullNet Communications Declares Quarterly Cash Dividend

FullNet Communications’ Board of Directors Approves 12.2% Increase in Quarterly Cash Dividend Under Its Quarterly Cash

February 21, 2026

Jacaruso Launches Lead Shark, Hotel-Specific AI Sales Intelligence

Jacaruso Launches Lead Shark, Hotel-Specific AI Sales Intelligence

AUSTIN, TEXAS, TX, UNITED STATES, February 13, 2026 /EINPresswire.com/ — Jacaruso Enterprises announces the launch of

February 21, 2026

Law Office of Jason M. Hatfield Attorney Lauri Thomas Secures 50 Percent Wage-Loss Award in Arkansas Workers’ Compensation Case

Law Office of Jason M. Hatfield Attorney Lauri Thomas Secures 50 Percent Wage-Loss Award in Arkansas Workers’ Compensation Case

Springdale, Arkansas – The Law Office of Jason M. Hatfield announced that attorney Lauri Thomas has obtained a

February 21, 2026

McCready Law Welcomes Trial Lawyer Donald R. McGarrah as Partner

McCready Law Welcomes Trial Lawyer Donald R. McGarrah as Partner

Chicago, Illinois – McCready Law today announced that trial lawyer Donald R. “Don” McGarrah has joined the firm as a

February 21, 2026

RestoPros of Southern New Hampshire Expands Emergency Restoration Services Across Region

RestoPros of Southern New Hampshire Expands Emergency Restoration Services Across Region

CANTERBURY, NH – February 13, 2026 – PRESSADVANTAGE – RestoPros of Southern New Hampshire has expanded its emergency

February 21, 2026

Mindmachines.com Advances Mind Technology with Enhanced pROSHI Protocols for Meditation Device

Mindmachines.com Advances Mind Technology with Enhanced pROSHI Protocols for Meditation Device

Dallas, Texas – February 13, 2026 – PRESSADVANTAGE – Mindmachines.com has expanded the capabilities of its ROSHIwave

February 21, 2026

TaxFree RV Highlights Montana Registration Strategy as California Vehicle Owners Face Rising Tax Burdens

TaxFree RV Highlights Montana Registration Strategy as California Vehicle Owners Face Rising Tax Burdens

RED LODGE, MT – February 13, 2026 – PRESSADVANTAGE – TaxFree RV, a vehicle registration specialist operating since

February 21, 2026

Amana Care Clinic Announces Enhanced Walk-In Medical Services for Muscatine Residents

Amana Care Clinic Announces Enhanced Walk-In Medical Services for Muscatine Residents

MUSCATINE, Iowa – February 13, 2026 – PRESSADVANTAGE – Amana Care Clinic – Muscatine has announced expanded walk-in

February 21, 2026

SASGOG Selects Momentum Association Management as New AMC Partner

SASGOG Selects Momentum Association Management as New AMC Partner

The Society for Academic Specialists in General Obstetrics and Gynecology (SASGOG) has selected Momentum Association

February 21, 2026

Radiant Autism Center Co-Founder Bobby Whitney Joins Board of The Owen Foundation

Radiant Autism Center Co-Founder Bobby Whitney Joins Board of The Owen Foundation

Radiant Autism Center co-founder Bobby Whitney joins The Owen Foundation board to expand autism advocacy and family

February 21, 2026

Bonsai Marketing Expands AI-Powered Hyper-Local Growth Platform to Help Sonoma County Businesses Dominate Search in 2026

Bonsai Marketing Expands AI-Powered Hyper-Local Growth Platform to Help Sonoma County Businesses Dominate Search in 2026

Bonsai Marketing expands its AI-powered hyper-local platform to help Sonoma County businesses dominate local search and

February 21, 2026

Backyard Banger to Showcase World’s First Garden Hose Kitchen & Wet Bar on Wheels at The Colorado Garden & Home Show

Backyard Banger to Showcase World’s First Garden Hose Kitchen & Wet Bar on Wheels at The Colorado Garden & Home Show

"Whether it's deployments, birthdays, graduations, football games, you name it, America grills in the backyard," Ty

February 21, 2026

Immersive Leadership Model Accelerates Growth Through 40 Short, Powerful ‘Moments’

Immersive Leadership Model Accelerates Growth Through 40 Short, Powerful ‘Moments’

Executive coach and bestselling author Scott Abbott's new interactive resource is designed to inspire and recharge

February 21, 2026

FLEET DATA CENTERS ANNOUNCES PRICING OF $3.8 BILLION OF SENIOR SECURED NOTES FOR HYPERSCALE FACILITY IN GROWING RENO HUB

FLEET DATA CENTERS ANNOUNCES PRICING OF $3.8 BILLION OF SENIOR SECURED NOTES FOR HYPERSCALE FACILITY IN GROWING RENO HUB

Fleet Data Centers announces pricing of $3.8 Billion of senior secured notes for Hyperscale facility in rapidly growing

February 21, 2026

Grief, Resilience, and Redemption: ’Losing Michele’ Earns Best Seller Distinction Following Widespread Praise

Grief, Resilience, and Redemption: ’Losing Michele’ Earns Best Seller Distinction Following Widespread Praise

In a powerful testament to the healing power of storytelling, author Alicia Trew’s deeply personal memoir, has

February 21, 2026

Bentley Rancho Mirage Celebrates Global Launch Of New Bentley Continental GT S And GTC S

Bentley Rancho Mirage Celebrates Global Launch Of New Bentley Continental GT S And GTC S

We’re thrilled to bring these remarkable vehicles to the Coachella Valley and to our clients who expect the very best

February 21, 2026