Difference between revisions of "Archive:Opensearch"

From FreeCulture.org
Jump to: navigation, search
m (Facebook application: + link)
(Wikibooks)
Line 67: Line 67:
 
* How does this play into what we're doing?
 
* How does this play into what we're doing?
  
=== Wikibooks ===
+
=== WikiSource ===
* Site: [http://en.wikibooks.org/wiki/Main_Page Wikibooks]
+
* Site: [http://en.wikisource.org/wiki/Main_Page Wikibooks]
 
* Size:
 
* Size:
 
* Signal-to-noise:  
 
* Signal-to-noise:  

Revision as of 19:04, 22 July 2007

This is the page for the (as yet unnamed) project to create search tools for students assigned public domain texts for class, and promote the public domain and open educational resources (OERs).

Inspiration

[1]

Project Summary

  • Databases containing public domain or open-licensed texts that might be assigned in a (college) class (not run by us)
    • Why college students? Because they buy their own books
    • Why texts that might be assigned in class? Because students are already looking for these books, and we can save them money (built-in audience)
  • API to search those databases and return results with a high signal-to-noise ratio for our purposes
    • Social searching? ("x% of users who searched for 'moby dick' found this useful...")
    • Why high signal-to-noise? We want to make this very easy to use (low barrier to entry)
  • Scripts to format output for various formats and devices
    • Read online, download, print
    • Why various formats and devices? We want to make this very easy to use (low barrier to entry)
  • End-user interfaces
    • Web site, Facebook application, etc.
    • Why various end-user interfaces? We want to make this very easy to use (low barrier to entry)
  • Promotional campaign
    • People need to know about this in order to use it!

Name

Everything needs a catchy name. What's our idea?

Databases to search

What archives do we want to search?

  • Must contain public domain or open-licensed texts that might be assigned in a (college) class
  • The higher the signal-to-noise ratio, the better

Project Gutenberg

  • Site: Project Gutenberg
  • Size:
  • Signal-to-noise: High
  • License: [2]
    • Verbatim redistribution OK (changes to formatting allowed)
    • Or, remove Project Gutenberg trademark and license; then, treat as public domain (any changes allowed)
    • There are some further conditions if you charge money for copies, which we won't, so need to need worry.
    • There are also some copyrighted titles in the database. These titles are marked as such, and redistribution without permission is prohibited.
      • How many titles is this?
      • Can these be included for our purposes? (Are we "redistributing", or simply pointing users to the Project Gutenberg copy?)
    • Note: These titles are public domain in the United States; in other countries, YMMV
  • Search API:
    • (Brendan) I contacted the site admin once about this for adding that to a book inventory system but never received a response, if I remember correctly, things could be different now. Scripting their own search forms should not be too difficult but asking would probably be nicer...
    • Should we just store a copy of every book? We can figure that out later.
  • Bandwidth usage policies:
    • Any concerns here? (Will they get upset if we're hitting their database?)
  • Output formatting:
    • Existing code for making books look nice?

Internet Archive

  • Site: Internet Archive
  • Size:
  • Signal-to-noise:
  • License:
  • Search API:
  • Bandwidth usage policies:
    • Any concerns here? (Will they get upset if we're hitting their database?)
  • Output formatting:
    • Existing code for making books look nice?

I think the concern was that not all of their links were good. additionally, a good part of their archive is from project guttenberg, so there'd be an overlap (somewhat annoying). however, if they have books project guttenberg does, but from another source, that's a good thing, and makes it more likely that they're going to have the stuff students are looking for.

a small political consideration: linda frueh at the IA was very excited about the program, and it might seem rude to design a whole program that they really like yet exclude their archive

Open Library

  • Site: Open Library
  • How does this play into what we're doing?

WikiSource

  • Site: Wikibooks
  • Size:
  • Signal-to-noise:
  • License:
  • Search API:
  • Bandwidth usage policies:
    • Any concerns here? (Will they get upset if we're hitting their database?)
  • Output formatting:
    • Existing code for making books look nice?

Audiobooks (human readings)

Librivox

  • Site: Librivox
  • Size:
  • Signal-to-noise:
  • License:
  • Search API:
  • Bandwidth usage policies:
    • Any concerns here? (Will they get upset if we're hitting their database?)
  • Output formatting:
    • Existing code for making books look nice?

Other databases

  •  ?

Other databases

Questions to ask

  • Site:
  • Size:
  • Signal-to-noise:
  • License:
  • Search API:
  • Bandwidth usage policies:
    • Any concerns here? (Will they get upset if we're hitting their database?)
  • Output formatting:
    • Existing code for making books look nice?

Search API

Output formats

Read online

Download

  • Plain text
  • HTML
  • PDF
  • Other formats?

Print

  • PDF?
  • CSS media-type: print?

Print-on-demand

Audio recording of computer reading (vocoder)

  • Would this be useful?
    • I don't think so. (Who wants to listen to a vocoder for multiple hours?) But we could include open-licensed audio recordings of human readings in the databases we search. --Gavin 02:56, 23 July 2007 (JST)

End-user interfaces

Web site

Brendan made a mockup. If it was online, we could link to it!

  • We should be sure to have a linkback to the original database
    • They'd probably appreciate it if we could even provide a direct link to their support / donations page

Facebook application

Firefox extension

  • (link)

Plug-in to online course management systems

  • Blackboard

Promoting it

  • Partners
  • Online outreach
  • Media outreach
  • Chapters & campus outreach

Partners

Student PIRGs

  • (link)
  • Student PIRGs could use this tool in their arsenal in their pro-OER activities

Public Knowledge

  • [4]
  • almost certainly would offer server space

Internet Archive

  • [5]
  • might be interested in some form of collaboration

Timeline

What needs to be done? Who will do it when?

The Future

What will happen to the project in one year's time? (If the answer is "No plans (yet)" that's fine, but we still should think about this.)