HOWTO make a $300 high-speed book scanner

Discuss

20 Responses to “HOWTO make a $300 high-speed book scanner”

  1. drivenbyboredom says:

    I scanned books for archive.org with a scanner not much different… You know, except for the fact that the cameras cost 5 times as much as this whole set up does…

    Then again maybe if they used cheaper cameras I wouldnt have been laid off…

  2. Anonymous says:

    #1 Outside of copyright expired works and government documents, I don’t think that there’s any real way for this to happen under copyright law. I don’t even think that libraries would be faxing and scanning journal articles to meet Inter-Library LOAN requests if the current market for buying individual articles from publishers had existed when the appropriate court cases were decided (and more importanly the 1977 copyright law was written)
    #7 And with severely brittle books, just about the only real option is to guilitine the bindings and scan from a stack of pages. Years ago I worked in the photoduplication division of the Library of Congress, although in those days we were microfilming books rather than scanning them. “We had to destroy the book to save it,” was an unpleasent part of our job, especially after a pretty impressive fire in the bulk de-acidification facility.

  3. jccalhoun says:

    This is a great project but man, 78 step and it doesn’t actually tell you anything. How about a parts list? How about some discussion of the camera used? A little less style and a little more substance please.

  4. jvilhuber says:

    Visions of the Librareome idea from V.Vinge…

  5. Anonymous says:

    The LEGO book scanner has an automatic page turner: http://www.engadget.com/2006/02/22/build-your-own-fullauto-bookscanner/

    They use a piece of gum to turn the pages.

    My own DIY scanning designs here, although I’ve never built one: http://www.pgdp.net/wiki/User:Branko#Planetary_scanner

  6. Anonymous says:

    JCCalhoun,

    Every step has a parts list, or pictures of the parts. I even describe the places and the types of dumpsters you might find them in, free.

    Step 74 specifically addresses which camera, and 76 describes exactly how to set them up. Step 37 covers how to hack the firmware on them.

    I even scanned a public domain book on hand tools and how to use them. If that’s not enough, I hope you can tell me what more to do. ;)

    You might want to give it a second looking-over. The PDF version is a bit easier to read.

    I think everyone would like to see an automatic page turner, but practically speaking it is not as useful as it seems. You can’t use them with brittle, old books (thanks Zaren) and what happens if you miss a page? It screws up all the software automation. The way this scanner is designed, you can actually relax and watch a DVD on its screen as you scan if it so pleases you.

    I am going to make a video comparing an automatic turner to me operating the scanner. I think you’ll see that my scanner compares favorably.

    Thank you all for your interest (and votes, if you voted!!)

  7. Anonymous says:

    You should also really check out the work at bkrpr.org The idea and the price are the same but it has many fewer parts and requires much less technical ability to put together or operate. Check out their youtube video of it in action.

  8. Anonymous says:

    Hi again,

    I didn’t expect that you’d read the whole thing (after all, it is written instructions for people who plan to build one), but I do think your criticism was unfounded, and I tried to give evidence by linking you to the steps that addressed your concerns — that way you wouldn’t have to read the whole thing.

    You have to be a member of Instructables (and logged in) to download the PDF. I’ll see if I can upload it somewhere else for you tonight.

    This scanner produces more than high enough resolution for OCR of textbooks and every other book I’ve thrown at it. The error rate isn’t too bad. It will be better with the next version of software, and Aaron is working on it now.

  9. Capissen says:

    @ #11 My thoughts exactly! Though that scanning method involved a little less, er, preservation of the original copy, if I recall.

  10. x99901 says:

    well hewwo to you too.

    Seriously though, to supplement interlibrary loans there should be a scan-exchange program in which you offer to scan a book in your library in exchange for a scan of a book in another library.

    Sure it would be more expensive than mailing books around, but eventually all the rare books that people actually read would be in the interlibrary scan database.

  11. urederra says:

    Sorry for the useless typo comment, but the title should read HOWTO, no HOWO, I think.

  12. jordan314 says:

    For me, the open-source beta software they released to turn the scanned pages into PDFs is the most interesting part. Does anyone know if other software exists like this exists? I’d prefer a mac version, but I’m going to try their PC beta.
    I’ve been wanting to scan all of my documents and throw them out for years now. I’m pretty sure I will use a camera and tripod rather than a scanner. I have several pieces of software that can create PDFs but they typically only work with the scanner, and I’m still determining which has the best OCR. I’ve been meaning to write an automator script that would grab a bunch of scans in iphoto, run them through the OCR program, combine the PDF, and maybe delete the original scans.
    Thanks!

  13. Takuan says:

    I love teh little Instructables, I love them lots!

  14. jackdavinci says:

    This is cool, but I’m still trying to figure out a way to batch scan my old photos.

  15. Anonymous says:

    ‘HOWO’ is actually an acronym for ‘How Our Wordcrunching Overlords’

  16. Anonymous says:

    Say Jordan314, our Matlab script will work on the Mac (it was developed on a Mac) but you’ll need a Matlab license, which is far from free. Sorry about that — it’s only been in development for a couple weeks.

    But our software is really only useful if you have the book scanner. You might want to look into other options, like Atiz Snapter, or ABBY Fotoreader, but I’m not sure if they’re available on the Mac (in fact, I kind of doubt it — regretfully).

    I know everyone wants an automatic page turner, but I honestly don’t think they’re much faster than my system. To prove this, I used Youtube Doubler to make a mashup of myself scanning against the machine. I wasn’t even rushing. ;)

    This is the link:
    http://youtubedoubler.com/?video1=http%3A%2F%2Fwww.youtube.com%2Fwatch%3Fv%3DY377v34RWwY&video2=http%3A%2F%2Fwww.youtube.com%2Fwatch%3Fv%3D-oOXXpxzETA

    If it doesn’t work, you can get it on the first page of my instructable.

  17. jccalhoun says:

    #12
    I really do think this is a neat project but I did find it too much effort to read through the description of the process. I guess I just wanted more of the kind of instructions that come with a flatpack bookshelf or something.
    I couldn’t get the pdf to load. It looked like the url was messed up but there might be a problem on my end.
    Any attempt to OCR the books? Whenever I scan books in I always convert them to plain text and I’m curious if this method would be high enough resolution for the OCR software to work well.

  18. zikzak says:

    The killer app for book scanning is an automatic page-turner.

    Having to turn each page of the book manually while scanning it is acceptable but slow. The ability to simply stick in a book and go play outside for a half hour while a robot flips through the pages would be game-changing. Better yet would be the ability to queue up a stack of volumes and simply let the thing run for days.

    I’m sure industrial grade machinery exists to do this, but after seeing this I have more faith that some enterprising makers could actually put such tech within reach of an average hobbyist…or free information guerrilla.

  19. Zaren says:

    I may still be under NDA as far as the technology I’m about to discuss, so I’ll be vague :)

    I spent a few months working for a company that had a book scanning project, with a setup very similar to this. The page turning was all manual, though. If you’re dealing with books that have any sort of aging to the pages, that last thing you want to do is just stick them in a machine and let it go. Brittle, inflexible pages can get ruined very easily if there isn’t a deft hand doing the turning.

Leave a Reply