Spidering Word files for embarrassing metadata

Cory Doctorow 12:10 pm Thu Apr 1, 2004

A hacker spidered every English microsoft.com site and sucked down all the Word documents, then used a script to identify interesting erasures left behind by the revision-tracking feature. Some interesting stuff fell out of his investigation.

A pointless idea came to my mind that instant: why not run a gentle web spider against all Microsoft sites in English, specifically looking for other instances of tracking data not removed from documents? I coded a bunch of scripts and let them run through the night, fetching approximately 10,000 unique documents; over 10% was identified as containing change tracking records. I decided to collect only those with deleted text still present, yielding a crop of over 5% of all documents. Quite impressive. Below, you will find a brief (and rest assured, incomplete) list of the most entertaining samples I've run into, along with some speculation (and only speculation) as to the reasons we see them.

Link

(Thanks, Eli the Bearded!)

50-year-old operating system ported to 30-year-old digital typewriter

CP/M is an operating system dating to the mid-1970s that found its niche giving cheap 8-bit home computers the flexibility, if not the power, of expensive workstations. The Brother SuperPowerNote… READ THE REST
The worst tech you can buy in 2023

Brian Merchant hails the worst tech of 2023, an anti-gift guide for the holiday season that "sits atop an intersection of so many discouraging trendlines that I can't help but… READ THE REST
Report: tech companies losing interest in Texas

It seems like just yesterday that tech companies were fleeing California's high costs and setting up shop in cheap, hip Texas cities such as Austin. But TechCrunch reports that the… READ THE REST
Save $169 on a lifetime license to Microsoft Windows 11 Pro and never look back

TL;DR: Revamp your digital world with this incredible lifetime license to Microsoft Windows 11 Pro, with its seamless interface and top-notch security, for only $29.97 (Reg. $199) until 11:59 PM on 1/07.… READ THE REST
Upgrade your tech for the new year with this refurbished iPad Pro, less than half price right now

TL;DR: Save over $350 on a refurbished Apple iPad Pro 10.5" 256GB, plus a free accessories bundle, with this sweet deal on sale for just $315.99 right now. Tech fans, it's time… READ THE REST
Make your rockstar dreams a reality for only $15.97 with this Guitar Lessons Training Bundle

TL;DR: The perfect last-minute holiday gift for an aspiring rocker, the 2024 Guitar Lessons Training Bundle is only $15.97 (Reg. $480) until 11:59 PM on 12/25. It's really never too late to make… READ THE REST