Day by Day Cartoon by Chris Muir

Wednesday, February 26, 2014

120 Scientific Papers Withdrawn After Being Proven to be Gibberish. No, Actual Computer-Generated Gibberish.

From Ace of Spades HQ:
Multiple layers of painstaking fact-checking editorial oversight.

So, some scientists at MIT had invented a program called "SCIgen" to generate, by computer, random scientific-sounding papers. They did this for amusement.
But people (especially in China, apparently) have been using the program to generate papers and then submit them to actual scientific publishers' subscription services. “The papers are quite easy to spot,” says Labbé, who has built a website where users can test whether papers have been created using SCIgen. His detection technique, described in a study published in Scientometrics in 2012, involves searching for characteristic vocabulary generated by SCIgen. Shortly before that paper was published, Labbé informed the IEEE of 85 fake papers he had found. Monika Stickel, director of corporate communications at IEEE, says that the publisher “took immediate action to remove the papers” and “refined our processes to prevent papers not meeting our standards from being published in the future”. In December 2013, Labbé informed the IEEE of another batch of apparent SCIgen articles he had found. Last week, those were also taken down, but the web pages for the removed articles give no explanation for their absence.

Ruth Francis, UK head of communications at Springer, says that the company has contacted editors, and is trying to contact authors, about the issues surrounding the articles that are coming down. The relevant conference proceedings were peer reviewed, she confirms — making it more mystifying that the papers were accepted.
It's possible the reviewers chalked up the computerese nonsense to a language barrier, figuring the "scientist" who wrote them spoke Chinese as a first language and was struggling with the English language. But this only goes so far, because, ultimately, these papers didn't make sense in any language. Because they were gibbrerish.

Labbé (the guy who built the tool for finding these fakes) wanted to prove how easy it was to spoof the system so he created a fake scientist named "Antkare."
Why? Why would 120 fake, gibberish, nonsense papers be submitted to these publishers? And how did they make it onto the system?

Well possibly this is a prank, or an attempt to prove how easy it is to get nonsense published, as Labbé already proved.

Or, possibly:

Apparently, in science, one gross method of ranking your authority is by counting up the number of times you're cited in other scientific papers.

So, what if you could just spam a lot of fictitious, gibberish papers and get them into "the system" (the subscription services) citing you a whole bunch of times? Then your crude bean-counting ranking goes up.
I'm guessing, it's the old "Publish or Perish" axiom preached at institutions of higher learning for those attempting to get tenure.

No comments:

Post a Comment