Written by Daniel John Dunevant on May 18, 2025, 4:45 am
As artificial intelligence becomes more deeply embedded in everyday life, it’s worth examining how large language models (LLMs) like ChatGPT are trained—and who gets to decide what they learn. One growing concern is the potential for these models to be intentionally influenced through the data they are trained on, raising serious ethical and societal questions.
Unlike traditional search engines, which return results from a broad range of web sources, LLMs generate answers based on patterns in the data they were trained on. This data is chosen by the model's developers, meaning there is significant control over what information is included—or excluded. While most major AI companies aim to use diverse and representative datasets, the selection process is inherently curated and subject to human bias.
In theory, yes. A powerful government, political group, or wealthy individual could seek to influence a language model by pressuring or partnering with its creators. If they succeed in shaping the training data—or influencing the system’s guardrails—they could subtly steer the model’s outputs to reflect specific narratives, ideologies, or misinformation. This is especially concerning as users increasingly turn to LLMs for answers, often without questioning the source.
While not immune to manipulation through SEO tactics or algorithm tweaks, search engines still draw from the open web. This decentralization makes them more resistant to centralized control. Users can see multiple viewpoints, verify sources, and compare information. With language models, the synthesized answer may feel more authoritative—even if it lacks transparency or source citations.
As LLMs become more influential, there’s a growing need for transparency about how they're trained and governed. Independent audits, diverse input sources, and public disclosure of training practices are crucial to building trust. Without these safeguards, there's a real danger that these tools could become vehicles for silent influence, shaping public understanding without users even realizing it.
The concern isn’t just hypothetical—it’s about who gets to shape the narratives of the future. AI language models are powerful tools, but like any tool, their impact depends on who controls them and how they’re used. As we embrace these technologies, we must also demand accountability, transparency, and protections against undue influence.
