All Text Analysis is Subjective

How to address inconsistencies in text analytics.

by Pascal De

Editor’s Note: This post is part of our Big Ideas series, a column highlighting the innovative thinking and thought leadership at IIeX events around the world.

Let’s face it – No captured text, be it from a survey form or on social media, can be analyzed with 100% objectivity. Still, it’s obviously useful to analyze text quantitatively and market researchers have used text as input for a long time, due to its versatility and breadth. But we cannot pretend that any Text analysis is free of ambiguity.

Reasons for this uncertainties are

The text itself doesn’t contain the full information/context or
The person or AI tool analyzing the text is either biased or inconsistent

The Source of Issues

Often, these issues are interconnected and occur together: The lack of context in short texts makes biases in the analysis more apparent. For example, one could understand the statement “Good service” in a Telecommunications context as “Good customer service” or as “Good network service”. A system or a person that would always assign “Good customer service” would be consistent but highly biased, shifting the analysis results in a specific direction, in turn causing the research buyer to think that customer service is more important than network service. Recently, AI-based automated systems have emerged that are at least in principle able to analyze text more consistently as they don’t get tired or distracted.

When evaluating the correctness or the accuracy of such automated systems, market researchers often compare against manual coding which is the current gold standard in text analysis. However, they tend to forget that manual coding is also biased and inconsistent, especially when coders need to keep track of hundreds of codes which sometimes are notoriously difficult/impossible to distinguish. We compared the results from different professional coders with the exact same codebook on the exact same data and found surprisingly low agreement across a variety of studies.

Keeping it Up to Code

In our anecdotal evidence, consistency can be greatly improved by a good and concise codebook. Bias, on the other hand, can be reduced intuitively by letting many different coders work through the same data and then averaging the results. However, this is very tedious and also prohibitively expensive. I would argue that a better, much faster and cheaper option is to use an AI system that learned from as many different manual coders as possible. AI systems are well known to be biased, especially when being trained on a single data source [1] but by learning from a diverse set of coders with different biases, the AI can learn to act as an “average coder”, resulting in an analysis with reduced bias compared to a full analysis with a single coder.

Join our talk at IIeX North America to find out how we compared human coders and different AI-based systems for a large-scale study in Latin America and discuss novel ways to improve quantitative text analysis.

References

1. https://hbr.org/2019/10/what-do-we-do-about-the-biases-in-ai

big ideas series career text analytics

Comments

Comments are moderated to ensure respect towards the author and to prevent spam or self-promotion. Your comment may be edited, rejected, or approved based on these criteria. By commenting, you accept these terms and take responsibility for your contributions.

Pascal De

1 article

author bio

Disclaimer

The views, opinions, data, and methodologies expressed above are those of the contributor(s) and do not necessarily reflect or represent the official policies, positions, or beliefs of Greenbook.

ARTICLES

Top in Research Methodologies

Research Methodologies

Future Trends Emerging in Mixed-Method Marketing Research

Explore the future of mixed-method marketing research, including AI, synthetic data, continuous insights, and evolving research workflows.

Ashley Shedlock

Content Producer at Greenbook

July 20, 2026

Read article

Research Methodologies

When Easy Becomes Empty: The Frictionless Feedback Fallacy

Making surveys easier doesn’t always improve insights. Discover why thoughtful feedback design balances convenience with meaningful, reflective respon...

Tarik Covington

Founder & Chief Strategist at Covariate. Human-Centered Insights

June 26, 2026

Read article

Research Methodologies

The Always-on Agency: How to Survive the Shift to Intelligence-Native Organizations

The insight agency model is under pressure. In an always-on world, success depends on becoming a decision partner, not just a supplier of research pro...

Hannah Mann

Founding Partner at Day One Strategy

June 25, 2026

Read article

Research Methodologies

The Ambiguity of Frequent Survey Participation: Is “Hyperactivity” a Signal of Professional Fraud?

Learn how to identify engaged respondents, detect bad actors, and improve data quality for more reliable research outcomes.

Sebastian Berger

Head of Science ReDem at Rep Data

June 18, 2026

Read article