This document describes the steps needed to load data onto Bibliomania.com .
Loading data requires coordinated action by three teams, this document describes those actions.
The data is captured either manually or by OCR by the Text Preparation Team (TPT). The books are chosen, marked up, and checked by the Editorial Team (ET).
ET decide on the book, what elements of the particular publication are to be excluded (eg notes from commentators).
The book is marked up on each page, crossing out page numbers and other unused items.
ET notify Bibliomania Data Board.
Note that the headword search relies upon bold text being martked up with the strong tag, not the B tag.
We need to put this in place.
The txt file should contain two or three columns separated by tabs.
filename | short chapter title | long chapter title (optional) |
0001.html | Chapter 1 | In the beginning |
$bib.url($bib.book(1541))
$bib.url($bib.book(1541).chapter(130))
$bib.url($db.AuthorTable.Object(238))
$bib.anchorURL($bib.book(1541).chapter(130), "p282_4")
Note that when linking to a named anchor in a later chapter the enchache step needs to be run twice, as initially the named anchor cannot be found. (See 27046
You can insert ANY raw TeX in files between <SPAN CLASS=tex><SPAN> tags. So to keep a paragraph intact use:
<SPAN CLASS=tex>\nobreak<SPAN><span class="footnote" number="1">Footnote text</span>
The numbers should be sequentially numbered within a chapter and should not exceed 50 in a chapter.
The first thing in a chapter should not be a footnote. If you really need to place a footnote as the first thing then add an empty paragraph before the footnote.
bibimport <operation> <recurse> <table> <id | _> [flushafter|flushduring] >errors 2>wmerrors
where
operation = number recurse | norecurse | skipChaps table = section, author, book, or chapter id = id number of sec/auth/book/chap in question (_ = all) flush = flag to determine when to flush memory
The operation number is calculated by adding up a combination of the operation numbers below.
encache | 1 |
index | 2 |
paginate | 4 |
keydottxt | 8 |
For instance,
bibimport 1 recurse section 1 bibimport 15 recurse book 522 flushduring
Most recent CVS $Author: timp $ @paneris.org
The important points in the life of this document are listed below (for detailed change history consult its CVS log.
The CVS log for this document is:
$Log: DataLoadingProcedures.html,v $
Revision 1.5 2003/11/18 20:09:18 timp
Add note about all records in bibimport
Revision 1.4 2001/07/28 14:41:06 timp
Tidy up
Revision 1.3 2001/07/27 02:50:00 timp
Sort out messagebURLs
Revision 1.2 2001/07/27 02:11:24 timp
Add notes from messageboards
Revision 1.1 2001/06/11 23:41:34 timp
Data Loading procedures
Revision 1.0 2001/03/08 16:47:46 TimP
First version