Home Data-Driven Thinking When Did Flawed Data Become OK?

When Did Flawed Data Become OK?

SHARE:

Data-Driven Thinking” is written by members of the media community and contains fresh ideas on the digital revolution in media.

Today’s column is written by Michael Mallazzo, director of marketing at Narrativ.

The data that powers the bulk of programmatic ad spend can only identify if a user is male or female about 50% of the time, according to an impeccably thorough report by Nico Neumann at Melbourne Business School.

In the eternal quest to figure out which “half of my ad budget is wasted,” we may want to start here. Neumann’s team estimates erroneous data costs advertisers $7 billion annually.

In 2014, Oracle paid roughly $400 million for BlueKai, a platform that pegs me as a married homeowner with two children who is interested in subcompact cars, rap and hip-hop, hunting and golf. I’m single, rent a Brooklyn apartment, proudly blast Springsteen and have never owned a car. And I hate golf.

After a cruelly ironic registration process that forced me to fork over my personal data to access my personal data, Acxiom’s abouthedata.com fared slightly better. It correctly identified me as male and provided some correct generalities, such as the killer insights that I’ve purchased apparel and food.

Abouthedata.com says that Acxiom helps companies “use data in responsible, ethical ways to create personalized experiences,” a refreshing tagline for these times. But personalization can only be as effective as the data that powers it.

So how did we come to accept the validity of flawed data?

Venture capitalists poured money into third-party data startups, which seemed to suggest the data was legit. Then large companies allocated massive budgets to programmatic advertising based on this data, which would also seem to suggest it was effective. And then big marketing clouds went shopping, further suggesting that the obscure periphery of our internet history data is worth hundreds of millions of dollars.

But what if large sectors of the data industry grew without their theses being fundamentally validated?

Of course, there is one company that understands the superfluousness of all this data: Amazon. To Amazon, you are what you buy. Nothing more, nothing less. Cambridge Analytica-type data is fundamentally meaningless to Amazon because it is less powerful than the first-party data it can provide to advertisers about shoppers. As Amazon prepares to eclipse $10 billion in ad revenue this year, the correlation between Amazon’s data to what we buy is why Sorrell lost the most sleep over its market entry.  

Subscribe

AdExchanger Daily

Get our editors’ roundup delivered to your inbox every weekday.

In a macro sense, the totality of data mining fundamentally can’t match the simple power of contextual targeting that powers Google and Amazon. And Amazon’s targeting is probably more powerful in the long term. At the end of the day, people telling an advertiser what they are searching for is still a lot more accurate than the best artificially intelligent guess.

The irony here is that the current apparent ineptitude of data providers actually protects the privacy of internet users. Digitally savvy consumers overwhelm ad tech algorithms with so many data points that they become difficult, if not impossible, for data brokers to distill in any meaningful way. This “unknown” audience becomes less valuable to advertisers and enjoys a superior internet experience with their privacy intact.

This is the hallmark of a broken market. Advertising is effectively supposed to be a tax that we all pay to enjoy free services that have no business being free. But the externality of digital advertising is being disproportionately picked up by the subset of individuals firms can merely caricature. This should inspire deeper soul searching in the industry.

Follow Narrativ (@hellonarrativ) and AdExchanger (@adexchanger) on Twitter.

Must Read

Monopoly Man looks on at the DOJ vs. Google ad tech antitrust trial (comic).

2025: The Year Google Lost In Court And Won Anyway

From afar, it looks like Google had a rough year in antitrust court. But zoom in a bit and it becomes clear that the past year went about as well as Google could have hoped for.

Why 2025 Marked The End Of The Data Clean Room Era

A few years ago, “data clean rooms” were all the ad tech trades could talk about. Fast-forward to 2026, and maybe advertisers don’t need to know what a data clean room is after all.

The AI Search Reckoning Is Dismantling Open Web Traffic – And Publishers May Never Recover

Publishers have been losing 20%, 30% and in some cases even as much as 90% of their traffic and revenue over the past year due to the rise of zero-click AI search.

Privacy! Commerce! Connected TV! Read all about it. Subscribe to AdExchanger Newsletters

No Waiting for May – CES Is Where The TV Upfront Season Starts 

If any single event can be considered the jumping-off point for TV upfronts, it’s the Consumer Electronics Showcase (CES), which kicks off this week in Las Vegas, Nevada.

Comic: This Is Our Year

Comic: This Is Our Year

It’s been 15 years since this comic first ran in January 2011, and there’s something both quaint and timeless about it. Here’s to more (and more) transparency in 2026, and happy New Year!

From AI To SPO: The Top 10 AdExchanger Guest Columns Of 2025

The generative AI trend generated endless hot takes this year, but the ad industry also had plenty to say about growing competition between DSPs and SSPs. Here are AdExchanger’s top 10 most popular guest columns of 2025 and why they resonated.