Rogue archivist Carl Malamud sez,
If you want access to all the tax filings of US nonprofit corporations, the IRS will sell you sets of DVDs for $2580 per year of data. We acquired all of these filings from 2002 to the present, a set of DVDs weighing 98.7 pounds. I'm pleased to report that all 6,461,326 of those returns are now successfully extracted and available on our new bulk data feed.
This data really should be available directly from the IRS at no charge. Accordingly, we've drafted a deed of gift offering the system back to the government.
Until the .gov people do take it over, we're offering access to all 5 TBytes of data using the http, ftp, and rsync protocols. Our hope is that developers will come up with lots of new uses for this information. In order to make the database even more useful, we've started working with Captricity to extract data from the forms and make it available as computable data (e.g., CVS files instead of TIFF images!).
Once search engines such as Google finish indexing the data, the tax filings of nonprofits will show up in the search results. When you search for a nonprofit, the first thing you see ought to be their home page. But, the next thing you ought to see are things like how much they pay their CEO, how much revenue goes for fundraising, and if they spend money to lobby public officials.
Nonprofits in the US had $1.87 trillion in 2009 revenues and it is these periodic filings that make the nonprofit marketplace work properly, just like SEC EDGAR filings help make the corporate markets work properly.
Reports of Exempt Organizations
The Stormtrooper Decanter is on back-order, but you can pre-order one from the next batch for £22 — it’s based on Andrew Ainsworth’s original movie helmet moulds from 1976, and will provide endless opportunities to point to lowball glasses and say things like “aren’t you a little short for a Stormtrooper drink?” (via Bonnie Burton)
Yahoo has released a machine-learning model called open_nsfw that is designed to distinguish not-safe-for-work images from worksafe ones. By tweaking the model and combining it with places-CNN, MIT’s scene-recognition model, Gabriel Goh created a bunch of machine-generated scenes that score high for both models — things that aren’t porn, but look porny.
I dote on fidget gadgets — soothing gizmos intended to give your hands something to keep busy with, like modern worry-beads — and while you can’t buy Chris Bathgate’s amazing machined sliders, and the Fidget Cube Kickstarter just closed, there’s still Thinkgeek’s new Jumbo Noah Fidget Toy, which looks like a lot of fun and […]
Nothing is more frustrating than needing to edit or sign a PDF and not having access to the original document. That’s why PDFpenPRO is a must-have app in our books.With this extremely useful app, you can merge, markup, and create PDF documents without ever having to convert your PDFs into word processor file formats. Type directly onto […]
From self-driving cars to stock market predicting software to the recommendations you get on Amazon and Netflix, machine learning is at the core of modern technology. You could find yourself building technology that is literally changing the world with the skills you’ll learn in The Complete Machine Learning Bundle. This bundle of 10 courses includes 406 lessons that will teach […]
This Python Mega Course will help you learn to code by teaching you to build 10 real-world apps that each highlight a unique use of Python.Job prospects for coders are still growing steadily—and with Python being one of the most popular coding languages out there today, it’s important for job seekers to demonstrate a widespread understanding of the […]