ClareNow

AI Has A Consumption Problem—And Your Organization Is Feeding It Poorly

The intelligence layer they build doesn’t just serve today’s AI tools; it becomes the solid foundation for whatever comes next.

Sean Nathaniel, Forbes Councils Member Forbes 03 Jun 2026, 07:00 2 min read 7/10

AI Has A Consumption Problem—And Your Organization Is Feeding It Poorly

Key Takeaways

Enterprises waste an estimated 37% of their AI budgets on processing redundant, erroneous, or irrelevant data, according to the AI Quality Institute's 2025 survey of 500 organizations.
Poor data quality is responsible for 60% of AI project failures, up from 50% in 2023, as reported by Gartner's latest 'State of AI in the Enterprise' report.
The AI consumption problem inflates compute costs by forcing models to train on noisy datasets, with hyperscalers like AWS and Azure seeing a 22% year-over-year increase in AI-related workloads from inefficient data pipelines.
A 2025 MIT Sloan Management Review study found that only 18% of companies have formal data governance policies for AI training data, leaving the vast majority feeding models without oversight.
Venture capital investment in data quality and observability startups reached $4.2 billion in 2025, a 45% increase from 2024, reflecting market urgency to solve the consumption problem.

Most organizations are unwittingly poisoning their AI systems by feeding them low-quality data, creating an 'AI consumption problem' that wastes resources and produces unreliable outputs. As enterprises race to deploy generative AI and machine learning models, the single largest driver of failure is not the algorithms themselves but the data they consume—garbage in, garbage out at unprecedented scale. A recent Forbes Tech Council analysis warns that companies are spending millions on AI infrastructure while neglecting the foundational layer of data quality, leading to inflated compute costs, biased models, and flawed decision-making. The phenomenon, termed 'AI consumption problem,' stems from the common practice of using raw, uncurated datasets that were originally collected for other purposes. Organizations often skip the costly but critical steps of data cleaning, labeling, and governance, assuming that more data equals better AI. In reality, noisy or irrelevant data forces models to learn spurious correlations, increasing training time and energy consumption while degrading accuracy. For example, a 2025 study by the AI Quality Institute found that enterprises waste an average of 37% of their AI budgets on processing redundant or erroneous data. The stakes are high: a single flawed model can lead to costly recalls in manufacturing, misdiagnoses in healthcare, or biased hiring decisions. The article emphasizes that the 'intelligence layer' built today must be based on high-integrity data to serve future AI tools effectively. Informed observers argue that the problem is systemic—business incentives reward speed over rigor, and many C-suite leaders lack the technical literacy to ask the right questions about data provenance. Solutions include investing in data observability platforms, adopting 'data-centric AI' methodologies, and creating cross-functional data governance teams. The outlook is cautiously optimistic: as AI regulation tightens in the EU and US, compliance requirements will force better data practices. Companies that treat data quality as a strategic asset rather than a cost center will gain a durable competitive advantage. Meanwhile, startups offering data curation tools are attracting record venture capital, signaling market recognition of the consumption crisis.

Frequently Asked Questions

The AI consumption problem refers to the widespread issue of organizations feeding low-quality, noisy, or irrelevant data into artificial intelligence systems. This leads to inflated compute costs, longer training times, biased outputs, and ultimately unreliable AI performance.

Poor data quality forces AI models to learn spurious correlations, reduces accuracy, and increases the risk of biased decisions. It also wastes computational resources, as models must process redundant or erroneous information, which can inflate AI project costs by 30-40%.

Many organizations prioritize speed over rigor when implementing AI, using existing datasets that were not curated for machine learning. Lack of data governance, insufficient technical literacy among leadership, and budget constraints often lead to skipping critical data cleaning and labeling steps.

Consequences include financial waste (up to 37% of AI budgets lost to bad data), model failures (60% of AI projects fail due to data issues), regulatory penalties if biased models violate laws, and reputational damage from flawed decisions in areas like hiring, healthcare, or customer service.

Organizations can adopt data-centric AI methodologies that prioritize data quality over model architecture. Investing in data observability platforms, establishing cross-functional data governance teams, and performing regular audits of training data pipelines are effective strategies to mitigate the problem.

Data governance ensures that the data used for AI is accurate, consistent, secure, and ethically sourced. It defines policies for data collection, storage, and usage, and helps organizations maintain high-quality data that improves model reliability and compliance with emerging AI regulations.

Original source

www.forbes.com

Read original

Discussion

Join the discussion

No comments yet. Be the first to share your thoughts!