The Mirror Is Already Distorted

We built AI from human data. The problem is that human data was never neutral to begin with.

There is a framing that has taken hold in mainstream conversation about artificial intelligence, and it goes something like this: AI is dangerous because it might develop its own values, its own agenda, its own autonomous moral reasoning. The concern is understandable, but it is also a distraction. The more immediate problem is not that AI will become something alien to us. It is that AI is already a very accurate reflection of us, and we are not comfortable with what we see.

AI language models are, at their core, statistical compression systems. They are trained on an enormous archive of human-produced artefacts: literature, journalism, scientific papers, social media commentary, legal judgements, policy documents, advertising copy, and billions of everyday exchanges. From this corpus, they learn patterns of language, meaning and association. They do not derive truth. They derive probability. What word, what framing, what claim most commonly follows from this context, given everything humans have written down?

That distinction matters enormously, because the archive is not neutral.

A Curated Archive, Not a Complete One

Every training corpus is shaped by a series of editorial decisions that happened long before any AI engineer made a choice. What gets digitised? What gets published in the first place? What is preserved, indexed, amplified, and linked to? The archive AI draws from skews heavily toward English. It skews toward the written over the spoken, toward the published over the marginalised, toward recent decades over earlier centuries. Entire cultural traditions, oral knowledge systems and non-Western epistemologies are structurally underrepresented, not through malice, but through the accumulated weight of who has historically controlled the means of documentation.

So the mirror is already distorted before it reflects anything back. The question is not whether bias enters the system. It is which biases, whose biases, and how deep the distortion runs.

Compression Does Not Mean Dilution

There is a tempting assumption that bias gets diluted in a large enough dataset. The opposite is often true. Training processes do not treat all patterns equally. A prejudice that appears once in a corpus carries little statistical weight. A prejudice that is structurally encoded across millions of job postings, judicial decisions, news articles, medical diagnoses and lending records becomes load-bearing. The model does not just inherit that prejudice. It reconstructs it with mathematical confidence, stripping away the hesitation, dissent and ambiguity that might otherwise signal to a human observer that something contested is happening.

That stripping of ambiguity is one of the most underappreciated risks in AI governance. Human bias is often legible. It comes with tells: a pause, a qualification, a pattern that a trained eye can interrogate. AI-generated outputs can encode the same bias with a fluency and apparent objectivity that makes it far harder to detect.

The WEIRD Expectation Problem

This brings us to perhaps the deepest tension in how we relate to AI: we expect it to be fairer, more rational and more objective than we are ourselves, even though it is built from data in which our irrationality and our unfairness are already embedded.

That expectation is not universal. It reflects a distinctly Western, Educated, Industrialised, Rich, Democratic worldview, what researchers call the WEIRD framework. The assumption that reason and objectivity are separable from cultural context, that a sufficiently sophisticated system can transcend the conditions of its own production, is itself a cultural artefact. It belongs to a particular intellectual tradition that also produced a disproportionate share of AI’s training data.

We are, in effect, asking a statistical reconstruction of our own cultural self-image to transcend the framework that produced it. That is not a technical problem. It does not yield to a technical solution. The demand that AI be objective is itself a value-laden position, embedded in the same corpus we are trying to audit.

The Feedback Loop Nobody Is Talking About Enough

There is a further problem that compounds all of the above, and it is one that most AI governance conversation has not yet adequately confronted. AI outputs are now entering the corpus. Generated text is being published, indexed, cited and shared at scale. Future models will train on artefacts that include the outputs of prior models. The statistical reconstruction is no longer even anchored purely to original human production. The mirror begins to reflect its own reflection, and the distortions compound in ways that become progressively harder to trace to source.

Researchers refer to this as model collapse, or synthetic data contamination. The concern is not simply that AI-generated content is proliferating, though it is, at a volume that already makes reliable detection difficult. The deeper concern is what happens to the statistical properties of future training data when a growing share of it was itself produced by a model with particular patterns, gaps and amplified biases. Those characteristics do not average out across successive generations of training. They tend to narrow. The diversity of expression, framing and perspective that gives a large corpus its breadth begins to contract, because the synthetic content that fills it is drawn from a much smaller distribution of probable outputs than the full range of human production it ostensibly represents.

Consider what this means practically. A model trained today on internet text will have absorbed millions of distinct voices, with all their contradictions, idiosyncrasies and disagreements. A model trained five years from now on a corpus that is twenty or thirty percent AI-generated will have absorbed a version of those voices that has already been filtered, smoothed and statistically regularised by its predecessors. The heterogeneity that researchers and ethicists point to as a partial counterweight to bias, the fact that humans disagree, that the corpus contains multitudes, begins to erode.

There is also a provenance problem with significant governance implications. When a human expert writes a flawed analysis, that analysis can be traced, attributed, contested and corrected. When AI-generated content enters the training pipeline, often without labelling, often aggregated across thousands of sources, the ability to identify where a particular pattern came from, and therefore whether it should be trusted, becomes extremely limited. Bias that was once traceable to a specific historical context or institutional practice becomes, over successive model generations, simply the way things are. It loses its history. It loses its contestability.

This is not a hypothetical risk on a distant horizon. It is already happening, and the governance frameworks being built right now were largely designed for a static corpus, not a self-replenishing one. Auditing a model for bias assumes you can identify the source of that bias. That assumption becomes harder to sustain with every generation of training that incorporates the outputs of the last.

The Harder Question

If AI is a mirror, then improving what it reflects back at us is not primarily a technical project. It requires changing what human society produces, values, documents and preserves. It requires confronting who controls the archive, who decides what counts as knowledge, and whose experience of the world gets encoded as signal rather than noise.

The governance frameworks being developed right now, the policies, audits, bias checklists and transparency requirements, are necessary. But they are operating at the surface of a much deeper structural problem.

The most uncomfortable question for any board, executive team or government considering AI adoption is this: if this system is reflecting us back at ourselves, are we prepared to sit with what we see?

Kylie Dalton is an AI ethics and governance consultant working at board and senior leadership level. Her work focuses on AI risk in complex public systems, with particular attention to vulnerable populations. She is Chair of the AI Expert Panel at the Governance Institute of Australia.

Have Any Questions?

The Mirror Is Already Distorted

We built AI from human data. The problem is that human data was never neutral to begin with.

A Curated Archive, Not a Complete One

Compression Does Not Mean Dilution

The WEIRD Expectation Problem

The Feedback Loop Nobody Is Talking About Enough

The Harder Question

The Mirror Is Already Distorted

What Two Days at Australia’s AI Agents Summit Taught Me About Governance

Your AI Has a Bias Problem. The Real Issue Is Where It Learned.

News Categories

account

cart

Where does your AI strategy start?

How Kylie Helps

Quick Links

Contact