How Google's book-scanner cleverly corrects for the curvature of an open book

Discuss

21 Responses to “How Google's book-scanner cleverly corrects for the curvature of an open book”

  1. eclectro says:

    it should be noted that the google book OCR solution pre-dates the scan robot that others have posted a link to here, which is probably more accurate as well. I find the scan robot solution more elegant than googles, Though I wonder how fast it is compared to google’s scanner. The cost for the scan robot is around $107K.

  2. schmod says:

    Book scanners have existed for a *long* time, and have generally tended to perform some sort of geometric correction.

    The hard part is flipping the pages.

  3. Enochrewt says:

    #4: I’m a Bibliophile, but I think sacrificing one book for the sake of having it accessible to millions of people is acceptable.

    Unless that book is super rare or one of a kind of course.

  4. Anonymous says:

    i can’t really say anything specific, because of scary nondisclosure agreements (i figure google is probably the entity to be afraid of if you’re going to be paranoid online). but i will say i worked as a book scanning grunt for google, and their method is much faster than any of these machines. this is also not a “new” method for them. just recently figured out by the public from these patents, i guess.

  5. Anonymous says:

    @#10 – You’re describing Google’s alternate method for scanning books. They do that when publishers send in books to be scanned. Or so I’ve heard. Haven’t actually seen it being done myself.

  6. treq says:

    I’m wondering if this is for special case books? Existing high-speed book scanners have a vacuum-powered wedge that inserts between two pages, applies vacuum, then scans while retracting the scan head upward, always normal to the page, so text curvature isn’t an issue typically and they’re able to get up to 2400 pages/hour (vid reference: http://www.youtube.com/watch?v=pb6E4Hrgi9Y )

    maybe google’s method will offer faster scan rates if they somehow get the page flipping to be quicker than it is?

  7. Anonymous says:

    Unfortunately Google’s book scanning can go wildly wrong. I would really like to know how this managed to pass muster and get posted in this horrific state:

    Hand-book of the Locomotive, Including the Construction, Running, and Management of Locomotive Engines and Boilers
    By Stephen Roper
    Edition: 14
    Published by E. Meeks, 1890
    324 pages

    Hand of archivist on pages:
    http://books.google.com/books?id=n3IVm-6PzdcC&printsec=frontcover#PPA201-IA1,M1

    http://books.google.com/books?id=n3IVm-6PzdcC&printsec=frontcover#PPP2,M1

    http://books.google.com/books?id=n3IVm-6PzdcC&printsec=frontcover#PPP3,M1

    Weird smeared text at bottom:
    http://books.google.com/books?id=n3IVm-6PzdcC&printsec=frontcover#PPA198,M1

    Pages in mid-flip during scan:
    http://books.google.com/books?id=n3IVm-6PzdcC&printsec=frontcover#PPA201-IA2,M1

    Sideways page:
    http://books.google.com/books?id=n3IVm-6PzdcC&printsec=frontcover#PPA109,M1

    Weird warped-edge page: (de-warp algorithm gone wrong?)
    http://books.google.com/books?id=n3IVm-6PzdcC&printsec=frontcover#PPA229,M1

    Total scanning disaster:
    http://books.google.com/books?id=n3IVm-6PzdcC&printsec=frontcover#PPA809,M1

    http://books.google.com/books?id=n3IVm-6PzdcC&printsec=frontcover#PPA819,M1

    This archivist was either intoxicated or on drugs, just did not care about doing a good job. DO-OVER!

  8. pruek says:

    Why using glass plates that flattened each page is not very efficient? Compare to what Google use, I believe placing book in a V-shaped is much better for OCR because you have 100% curvature-free for the first place. You can also check out this V-Shaped book scanner at http://atiz.com.

  9. JollyOrc says:

    Thankfully they’re not using the “shred first, assemble later” technique that was featured at Verner Vinges’ Rainbows End…

  10. Fang Xianfu says:

    Alternatively, you could just open the book at a 90 degree angle instead of putting it on a flat surface. A while back there was a home-made book scanner on here that used that method.

  11. Anonymous says:

    Why aren’t they using this? Seems like you’ll get a better result every time.

    http://www.youtube.com/watch?v=6vI-DZIVOQw

  12. Anonymous says:

    1) Flipping pages instead of moving the entire scanner assembly uses less energy and is probably less prone to break down. When you deal in the volume that Google deals in, a 10% energy savings could be worth many millions of dollars.

    2) With Google’s method, it is click, flip, click, flip, click, flip all within seconds. The speed advantage must be great.

  13. Darren Garrison says:

    Hm. Maybe that explains how they do their “dual layer” approach to PDFs, too– I’ve had many Google Books PDFs (and some Microsoft scanned ones, too) be in two layers– one with the text and all black lines barely visible, and one that is just text and black lines. When changing pages or opening the document, for just a moment you see the textless page before the text is overlayed on it.

  14. Cheqyr says:

    @2: Exactly what I was thinking … opening a book flat is basically asking to crack the spine.

  15. ansel says:

    When it comes to Google’s book scanning, I think this is the more important story – How Google is building a fee-based closed-source monopoly on the digital library of the future

  16. Anonymous says:

    Google should throw its $ and brains to use either CT xray or MRI in combination with a 3D OCR technique, that could be used to scan entire shelves, blocks of books or books rolling by on conveyor.

    CT xray and MRI are basically 3D OCR (organic ‘character’ recognition) tools.

    The biggest challenge will be developing the technique and calibration to resolve paper and ink and their variability. Also, I don’t believe current technology has fine enough resolution to distinguish pages.

    But conceptually, it seems reasonable.

    CanadianAlien

  17. Lex10 says:

    They’re Google! They could use monks fer chrissakes!

  18. Anonymous says:

    When the article title said that they “cleverly” corrected for page curvature, I assumed “cleverly” meant “wow! that was obvious now that you think of it.” Instead, I get “oh noes! science!”

    Very intriguing, i admit, but too much to swallow at 12 midnight.

  19. Anonymous says:

    This is more interesting than actually useful.

    Any halfway decent OCR software would be able to accurately scan text with the little bit of curvature.

  20. Anonymous says:

    Can’t they afford to sacrifice a book, get rid of the spline with a shear and use a much simpler scanner which would do a perfect job without all that fuss?

Leave a Reply