Date issued: | 2006-09-28 |
---|---|
Status of document: | Draft Implementation Note. This is only a preliminary draft that is still under development; it has not yet been presented to the whole community for review. |
This version: | http://www.language-archives.org/NOTE/toolbox-20060928.html |
Latest version: | http://www.language-archives.org/NOTE/toolbox.html |
Previous version: | None. |
Abstract: |
This Note presents a framework for creating OLAC static repositories from a collections of records created and maintained as a Toolbox database. |
Editors: |
Alexander Nakhimovsky, Colgate University (mailto:[email protected]) |
Copyright © 2006 Tom Myers and Alexander Nakhimovsky . This material may be distributed and repurposed subject to the terms and conditions set forth in the Creative Commons Attribution-ShareAlike 2.5 License.
This Note presents a framework for creating OLAC records and static repositories in Toolbox. The framework consists of the following components:
A Toolbox project (http://localhost:8080/tbox/OLAC_in_Toolbox_Generic.zip), created by Joan Spanne of SIL, that can serve as a template for creating OLAC records.
A program called BoxReader that converts Toolbox files into XML (XHTML, to be precise) without loss of information. BoxReader can operate on a wide range of Toolbox data files, as long as they are accompanied by the matching .typ files.
XSLT stylesheets that convert the output of BoxReader into an OLAC repository.
An HTML file, index.html, with forms for submitting Toolbox files and running the software (BoxReader and XSLT files). This file also serves as the User Guide. The User-Guide portion of the page is reproduced in the final section of this Note.
To create OLAC metadata in Toolbox, you only need to download Joan Spanne's project. The software can be run remotely from http://csproj2.colgate.edu:8080/tbox or you can run it on your own computer after installing the Tomcat web server. (Note that the computer does not to have to be connected to the Internet, but it does need Tomcat installed.) Both Tomcat and BoxReader are written in Java, so Java also needs to be installed. After installing Java and Tomcat, download and install the that contains all the software. The rest of this Note provides installation instructions and a user guide. If you would like to see the transformations of Joan Spanne's example, go directly to http://csproj2.colgate.edu:8080/tbox20060906/ and run it as a demo.
The platform consists of two big enabling components: Java and Tomcat.
Start by checking whether you have Java on your computer, and if so, what version.
On Windows. go Start - Settings - Control Panel. If there is no Java control panel, you need to install Java. If there is such a panel, open it, select the General tab, click About. If the version is earlier than 1.5.0, you do have to install Java.
Java installers can be downloaded from the Sun J2SE download page (http://java.sun.com/javase/downloads/index.jsp). The page has a help link at http://java.sun.com/downloads/faq.html. Since we use JSP (JavaServer Pages) in our code, you must download and install the JDK, not JRE. Install the latest update, but probably not the one that included Netbeans or JavaEE.
Once you have installed Java or established that you already have an adequate version, you need to create the JAVA_HOME environment variable, pointing to the directory where you have installed Java.
Tomcat's main web page is http://tomcat.apache.org/. It has links to both Download and Documentation pages. Within the current documentation (August 2006, version 5.5, http://tomcat.apache.org/tomcat-5.5-doc/setup.html) there is a setup section giving setup directions for Windows and several versions of Unix. Follow the directions so that Tomcat starts automatically every time your restart your computer.
Installing Tomcat on the OS X Macintosh is slightly more involved. Official but seriously dated instructions can be found at Java and Tomcat on Mac OS X, Part I (http://developer.apple.com/internet/java/tomcat1.html) and Java and Tomcat on Mac OS X, Part II (http://developer.apple.com/internet/java/tomcat2.html). One problem is that a version of Java adequate for Tomcat is not necessarily adequate for the OLAC webapp; so far as we know, you cannot upgrade your Java to 1.5.0 or better (confusingly known as J2SE 5.0) unless your OS X is upgraded to 10.4 or better. If you have OS 10.4 but not Java 1.5.0, you can download the Java upgrade from Apple's Developer Connection Java downloads (http://developer.apple.com/java/download/). We used Ben Slade's Instructions for Installing Java, Tomcat, and MySQL on Mac OS X 10.4.x (http://www.benslade.com/projects/java/tomcat/InstallingJavaTomcatMySQLOnMacOSX/) without great difficulty.
Once Tomcat is installed, place the tbox.war file into Tomcat's subdirectory called webapps. Tomcat will automatically recognize it and unpack it into a subdirectory of webapps called tbox. This completes the installation.
The process of converting metadata created in Toolbox into an OLAC static repository goes through two steps. In the first step, Toolbox-created metadata is converted into an XML document containing a collection of metadata records. The records are in the correct OLAC format, but the entire collection is not an OLAC-conformant repository. We make this a separate step in case you want to copy and paste the records into a repository that already exists. In the second step, the output of the first step is converted into a full-fledged static repository. An important part of the second step is the process of entering your OLAC headers, such as the repository name, its base URL, the name of the curator(s), and so on. (See below subsection on Customizations.)
At http://csproj2.colgate.edu:8080/tbox we provide a form for each step so you can specify the input files. They can be specified in two ways: by providing a URL or by pasting the contents of the input file into a text area. If the URL input box is not empty then the URL is used, otherwise the contents of the text area are used.
If you simply click the submit buttons of Forms 1 and 2, you will see the demo we have prepared. If you want to use the software to create and convert your own metadata files, there are three options:
Create your own Tomcat installation as explained above.
Place your data files on a web server and provide URLs.
If you only want to make changes in the Toolbox project, you can open your .db and/or .typ files in a text editor and copy-paste into the appropriate text areas of the forms.
For additional detail and a demo, go to the csproj2.colgate.edu site. As the site explains, an important part of the process is
The example we show in our demo identifies the static repository as belonging to N-topus Software, with our own email addresses. These values are defined within a listing of lookup:strings at the top of the repository.xsl file, and can easily be edited in a plain text editor. (Only the values, not markup, need to be changed.) Here are the lines to be edited:
<lookup:strings>
<lookup:string id='repositoryName'>Virtual Toolbox-Sample Archive</lookup:string>
<lookup:string id='baseURL'>http://www.n-topus.com/nsf/toolbox/sample/olac-records.xml</lookup:string>
<lookup:string id='protocolVersion'>2.0</lookup:string>
<lookup:string id='adminEmail'>mailto:[email protected]</lookup:string>
<lookup:string id='earliestDatestamp'>2006-05-03</lookup:string>
<lookup:string id='deletedRecord'>no</lookup:string>
<lookup:string id='granularity'>YYYY-MM-DD</lookup:string>
<lookup:string id='repositoryIdentifier'>sample.toolbox.nsf.n-topus.com</lookup:string>
<lookup:string id='sampleIdentifier'>oai:sample.toolbox.nsf.n-topus.com:01</lookup:string>
<lookup:string id='archiveURL'>http://www.n-topus.com/nsf/toolbox/sample/</lookup:string>
<lookup:string id="curator">Tom Myers, Alexander D. Nakhimovsky</lookup:string>
<lookup:string id="curatorTitle"></lookup:string>
<lookup:string id="curatorEmail">mailto:[email protected]</lookup:string>
<lookup:string id="institution">N-Topus Software</lookup:string>
<lookup:string id="institutionURL">http://www.n-topus.com/</lookup:string>
<lookup:string id="shortLocation">Hamilton, NY, USA</lookup:string>
<lookup:string id="location">56 Payne Street, Hamilton NY 13346, USA </lookup:string>
<lookup:string id="synopsis">This Virtual Toolbox-Sample Archive contains materials assembled from
Toolbox projects within subdirectories of a web application.
</lookup:string>
<lookup:string id="access">Access control is determined by server configuration (e.g., web.xml files). </lookup:string>
</lookup:strings>
One further customization concerns the value of the baseURL string. This value can be replaced with the dynamic URL that invokes the repository.xsl program. When you click the CreateOLAC button, it connects to this dynamic URL:
http://csproj2.colgate.edu:8080/tbox20060904/xslt.jsp?
xml=olac%2FOLAC_naskapi_db.xml&xsl=repository.xsl&
mimeType=text%2Fxml%3Bcharset%3DUTF-8
The hostname on that URL will naturally be your hostname, where you're running Tomcat. Consider the segment of the URL between the ? and the first & character. It specifies the subdirectory name and the name of your XML file that you saved in Step 1. If you replace this segment with your own values, and further replace the baseURL value with the revised dynamic URL, the static repository will change automatically any time the underlying XML file changes. (The underlying XML presumably will change when you edit your Toolbox .db file and go through Step 1, replacing the previous XML file.)
We are grateful to Joan Spanne's of SIL for sharing her work with us and to Gary Simons of SIA for comment and advice. Work on this paper was partially supported by the NSF grant #0553546 under the Documenting Endangered Languages program.