Here are my entirely incoherent live notes from the Brewster Kahle Q&A yesterday at Mann
Bit of description about IA.
- 26 terabytes of text in LOC. Can have it in back of your car:) Snapshot taken of web every 2 months since 1996-4 billion pgs/1 terabyte of data per snapshot now
- Working w/librarians to curate, building searchable collections. 1-2 petabytes of TV, 20 channels of DVD-quality international TV. Not available yet (not b/c of rights)
- Lot of moving images, news and audio files, books (1 million, freely available).
- 8 scanning centers internationally—1000 books per day.
Issue of in print works and copyright
- Google is current monopoly and we as a field are helping Google to achieve this
- Research-Different style of use for large corpora
- Not in favor of Google so what’s solution? IA interested in working with universities. IA good at out of coppyright (2 mil by end of 2009);
- In print: publishers making available—want to have another platform like Amazon, not iTunes model. Google “sucking the air out of the room, killing the people that feed them;” our content is the thing that’s important, not Google.
- How to deal with OP and in copyright? Loan model. You check them out and “return” them. How to return? “Melt in your hand PDF?” We’ll figure it out but important to work on it fast. “If you don’t stand up for your right to loan, you will lose it.”Pay per view may work for Google but not for libraries; we’re looking at using this in new and different ways
- Move quickly and try something new “get dirty” find out what works and doesn’t. IA scanned and made copies of web pages available w/no lawyers involved. Worked because nonprofit, respectful, and take down if objections. Move fast or we’re toast; we’ll be customer service for Google/Elsevier/outlets.
- Support other avenues (PLos) Need open platforms
How to provide access to developing countries with low bandwidth?
- Likes OLPC, 200 dpi screen good enough to read JPGs not fonts. Universal access to developing countries with no high speed Internet.
- Hasn’t been a day’s walk from an Internet café. How try to do that? POD, hopping networks, broader Internet infrastructure, caching servers (OLPC—one person gets slow response but others get the same thing faster). Computers cheaper though telecom monopolies 12 billion a yr industry in libraries. Spend your money in other ways (nice thought)