Thursday, October 10, 2013

The difficulty of finding out what we don't know

A snippet of a long Wired article "Scientific Data Has Become So Complex, We Have to Invent New Math to Deal With It"
Topological methods are a lot like casting a two-dimensional shadow of a three-dimensional object on the wall: they enable us to visualize a large, high-dimensional data set by projecting it down into a lower dimension. The danger is that, as with the illusions created by shadow puppets, one might be seeing patterns and images that aren’t really there. (italics mine)
“There’s a terrible math joke that topologists can’t tell the difference between their rear end and a coffee cup because the two are topologically equivalent,” said Benjamin Recht, a computer scientist at the University of California, Berkeley who says it is so far unclear when TDA works and when it might not. The technique rests on the assumption that a high-dimensional big data set has an intrinsic low-dimensional structure, and that it is possible to discover that structure mathematically. Recht believes that some data sets are intrinsically high in dimension and cannot be reduced by topological analysis. “If it turns out there is a spherical cow lurking underneath all your data, then TDA would be the way to go,” he said. “But if it’s not there, what can you do?” And if your dataset is corrupted or incomplete, topological methods will yield similarly flawed results. 


7 comments:

Eric the Fruit Bat said...

There's a reason why Star Trek TNG came to rely more and more on the holodeck and soap opera.

Icepick said...

There’s a terrible math joke that topologists can’t tell the difference between their rear end and a coffee cup because the two are topologically equivalent.

This joke only holds up if the coffee cup has a handle that makes a full loop.

/pedantry

Methadras said...

Icepick said...

This joke only holds up if the coffee cup has a handle that makes a full loop.

/pedantry


What about a mobieus loop? :D

Trooper York said...

I thought we agreed that now that Freeman isn't around we will not have any math posts?

Trooper York said...

You need some Red Sox posts Lem!

For example why do all the Red Sox want to grow beards so they look like Rosie O'Donnell's vagina.

Ignorance is Bliss said...

... topologists can’t tell the difference between their rear end and a coffee cup ...

It's actually quite easy to tell the difference just by the taste of the coffee.

Mitch H. said...

IiB, that would be cheating. Taste is an additional dimension.

From my experience, the thing to watch out for are scientists with crap datasets trying to torture meaning out of them by piling abstraction after abstraction on top of the dataset, and ignoring or obscuring the outrageous confidence intervals.