<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE rss [<!ENTITY % HTMLlat1 PUBLIC "-//W3C//ENTITIES Latin 1 for XHTML//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml-lat1.ent">]>
<rss version="0.92" xml:base="http://www.hjournal.org">
<channel>
 <title>HyperJournal Web Site - Documentation</title>
 <link>http://www.hjournal.org/taxonomy/term/1/0</link>
 <description></description>
 <language>en</language>
<item>
 <title>Books</title>
 <link>http://www.hjournal.org/books</link>
 <description>&lt;h2&gt;Help Us by buying books&lt;/h2&gt;
We get a small percentage of the money you will spend on the books listed here provided you buy directly after you click the link. If you're going to buy one of these books, please do so via these links. There will be no charge on you, the price remains the same, we just get paid from Amazon. Thank you!

&lt;h3&gt;Open Access, Scholarly Publishing, Digital Libraries&lt;/h3&gt;

&lt;iframe src="http://rcm.amazon.com/e/cm?t=hyperjournal-20&amp;o=1&amp;p=8&amp;l=as1&amp;asins=3540257918&amp;fc1=000000&amp;IS2=1&amp;lt1=_blank&amp;lc1=0000ff&amp;bc1=ffffff&amp;bg1=ffffff&amp;npa=1&amp;f=ifr" style="width:120px;height:240px;" scrolling="no" marginwidth="0" marginheight="0" frameborder="0"&gt;&lt;/iframe&gt;

&lt;iframe src="http://rcm.amazon.com/e/cm?t=hyperjournal-20&amp;o=1&amp;p=8&amp;l=as1&amp;asins=0918006813&amp;fc1=000000&amp;IS2=1&amp;lt1=_blank&amp;lc1=0000ff&amp;bc1=ffffff&amp;bg1=ffffff&amp;npa=1&amp;f=ifr" style="width:120px;height:240px;" scrolling="no" marginwidth="0" marginheight="0" frameborder="0"&gt;&lt;/iframe&gt;

&lt;iframe src="http://rcm.amazon.com/e/cm?t=hyperjournal-20&amp;o=1&amp;p=8&amp;l=as1&amp;asins=1594076707&amp;fc1=000000&amp;IS2=1&amp;lt1=_blank&amp;lc1=0000ff&amp;bc1=ffffff&amp;bg1=ffffff&amp;npa=1&amp;f=ifr" style="width:120px;height:240px;" scrolling="no" marginwidth="0" marginheight="0" frameborder="0"&gt;&lt;/iframe&gt;

&lt;iframe src="http://rcm.amazon.com/e/cm?t=hyperjournal-20&amp;o=1&amp;p=8&amp;l=as1&amp;asins=3540240306&amp;fc1=000000&amp;IS2=1&amp;lt1=_blank&amp;lc1=0000ff&amp;bc1=ffffff&amp;bg1=ffffff&amp;npa=1&amp;f=ifr" style="width:120px;height:240px;" scrolling="no" marginwidth="0" marginheight="0" frameborder="0"&gt;&lt;/iframe&gt;
</description>
 <pubDate>Thu, 23 Mar 2006 16:21:35 +0100</pubDate>
</item>
<item>
 <title>Training Courses</title>
 <link>http://www.hjournal.org/training</link>
 <description>&lt;p&gt;The HyperJournal Association offers hands-on training courses on HyperJournal. So far we offer 2 types of courses:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Creating and managing a Journal with HyperJournal (This course is for a non-technical audience).&lt;/li&gt;
&lt;li&gt;Installing, configuring and customizing HyperJournal.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The following is a list of past and upcoming courses. If you want us to organize a course at your site, please contact us at info@hjournal.org&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;17 Feb 2006, Pisa, Italy. Both courses in Italian.
&lt;ul&gt;
&lt;li&gt;&lt;a href="training1"&gt; Program&lt;/a&gt;.
&lt;li&gt;&lt;a href="training1_pictures"&gt;pictures and training stuff (slides and exercises)&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;li&gt;&lt;strong&gt;Upcoming: &lt;/strong&gt;Late april 2006, Italy, city to be defined (do you want to host a course? write us at info@hjournal.org)&lt;/li&gt;
&lt;/ul&gt;
</description>
 <pubDate>Tue, 28 Feb 2006 12:45:26 +0100</pubDate>
</item>
<item>
 <title>Italian translation of the user manual</title>
 <link>http://www.hjournal.org/node/75</link>
 <description>&lt;p&gt;An italian translation of HyperJournal 4 Dummies. A beginner's guide to HyperJournal has been released today. The italian version "HyperJournal 4 Dummies. Una guida per principianti" can be downloaded from the documentation section of the web site.&lt;/p&gt;
</description>
 <pubDate>Tue, 21 Feb 2006 16:44:39 +0100</pubDate>
</item>
<item>
 <title>Translating HyperJournal</title>
 <link>http://www.hjournal.org/doc_dev/translating</link>
 <description>HyperJournal uses a &lt;a href="http://smarty.php.net/manual/en/config.files.php"&gt;smarty configuration file&lt;/a&gt; to handle translations.
A config file is loaded at the beginning of each template: &lt;pre&gt;{config_load file="$lang.cfg"}&lt;/pre&gt; 
&lt;p&gt;
The HyperJournal's smarty templates contains some
identifiers that are substituted with a localization string before the output is sent to the browser.
&lt;/p&gt;
&lt;p&gt;
There is a localization file for each language which contains the translations of all the identifiers of hyperjournal. The name of the localization file is the  &lt;a href="http://en.wikipedia.org/wiki/ISO_3166-1_alpha-2"&gt;ISO 3166-1 alpha-2 country code&lt;/a&gt; (lowercase) followed by &lt;i&gt;.cfg&lt;/i&gt; (e.g. &lt;i&gt;en.cfg&lt;/i&gt; for english, &lt;i&gt;it.cfg&lt;/i&gt; for italian) and the file is placed in the i18n directory.
&lt;/p&gt;

&lt;h3&gt;Creating a localization file and adding a new language&lt;/h3&gt;
To localize HyperJournal you need to create a cfg file and place it in the i18n/ directory, hyperjournal will then load your localization file and the new language can be selected in &lt;i&gt;control panel--&gt;journal&lt;/i&gt;. It's suggested to use an existing localization file and modify the identifier's values to match your language.
&lt;h3&gt;Anatomy of a localization string&lt;/h3&gt;
Do not forget that you cannot place carriage returns into localization strings (they must be one line long).


&lt;h3&gt;HTML character entities&lt;/h3&gt;
All accented and special characters must be escaped with HTML character entities.
Some characters must be escaped to avoid conflict with XHTML markup: angle brackets (&amp;lt; and &amp;gt;), and ampersands (&amp;amp;). In addition, there are a few other characters that must be escaped, such as non-breaking spaces (&amp;nbsp;), angle quotes (&amp;laquo; and &amp;raquo;), curly apostrophes (&amp;#8217;) and curly quotes.
For more information about the W3C's best practices involving character encodings and character entities, see the following references:
&lt;ul&gt;
&lt;li&gt;&lt;a href="http://www.w3.org/TR/2004/WD-i18n-html-tech-char-20040509/#IDAPNGO"&gt;http://www.w3.org/TR/2004/WD-i18n-html-tech-char-20040509/#IDAPNGO&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.w3.org/International/tutorials/tutorial-char-enc/#exceptional"&gt;http://www.w3.org/International/tutorials/tutorial-char-enc/#exceptional&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;
&lt;b&gt;Caution:&lt;/b&gt; Be sure to save your localization file using an utf-8 encoding!&lt;/b&gt;
&lt;/p&gt;</description>
 <pubDate>Sat, 30 Apr 2005 01:43:48 +0200</pubDate>
</item>
<item>
 <title>Citation Extraction: extracting text from PDF/PS: result table</title>
 <link>http://www.hjournal.org/aci/citationextraction/pdfps2text/table</link>
 <description>&lt;table border="1"&gt;

&lt;tr&gt;
&lt;th&gt;Caso&lt;/th&gt;
&lt;th&gt;Programma&lt;/th&gt;
&lt;th&gt;Versione&lt;/th&gt;
&lt;th&gt;Licenza&lt;/th&gt;
&lt;th&gt;Input&lt;/th&gt;
&lt;th&gt;Output&lt;/th&gt;
&lt;th&gt;Lettere accentate&lt;/th&gt;
&lt;th&gt;Fedeltà&lt;/th&gt;
&lt;th&gt;Separa paragrafi&lt;/th&gt;
&lt;th&gt;Separa pagine&lt;/th&gt;
&lt;th&gt;Spazi e indentazione&lt;/th&gt;
&lt;th&gt;Dimensione carattere&lt;/th&gt;
&lt;th&gt;Corsivo, grassetto&lt;/th&gt;
&lt;th&gt;Testo su più colonne&lt;/th&gt;
&lt;th&gt;Posizione assoluta elementi&lt;/th&gt;
&lt;th&gt;Modalità di esecuzione, file prodotti&lt;/th&gt;
&lt;/tr&gt;

&lt;tr&gt;
&lt;td&gt;1&lt;/td&gt;
&lt;td&gt;pdftotext&lt;/td&gt;
&lt;td&gt;xpdf v 3.00&lt;/td&gt;
&lt;td&gt;GPL&lt;/td&gt;
&lt;td&gt;PDF&lt;/td&gt;
&lt;td&gt;plain&lt;/td&gt;
&lt;td&gt;Sì (ISO-8859), ma non per TeX&lt;/td&gt;
&lt;td&gt;Può perdere caratteri&lt;/td&gt;
&lt;td&gt;No, mette tutto di seguito, anche i titoli&lt;/td&gt;
&lt;td&gt;^L&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Testo in ordine sequenziale, ma pu fare confusione&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Produce un file txt: nome.pdf -&gt; nome.txt&lt;/td&gt;
&lt;/tr&gt;

&lt;tr&gt;
&lt;td&gt;2&lt;/td&gt;
&lt;td&gt;pdftotext -layout&lt;/td&gt;
&lt;td&gt;xpdf v 3.00&lt;/td&gt;
&lt;td&gt;GPL&lt;/td&gt;
&lt;td&gt;PDF&lt;/td&gt;
&lt;td&gt;plain&lt;/td&gt;
&lt;td&gt;Sì (ISO-8859), ma non per TeX&lt;/td&gt;
&lt;td&gt;Può introdurre righe spurie e
spazi nelle parole&lt;/td&gt;
&lt;td&gt;No, li separa solo se verticalmente e uniformemente separati
(non affidabile)&lt;/td&gt;
&lt;td&gt;^L&lt;/td&gt;
&lt;td&gt;Sì&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Viene mantenuto il layout a colonne&lt;/td&gt;
&lt;td&gt;Approssimata (plain text con layout)&lt;/td&gt;
&lt;td&gt;Produce un file txt: nome.pdf -&gt; nome.txt.&lt;/td&gt;
&lt;/tr&gt;

&lt;tr&gt;
&lt;td&gt;3&lt;/td&gt;
&lt;td&gt;pdftohtml -nomerge -noframes&lt;/td&gt;
</description>
 <pubDate>Fri, 22 Apr 2005 16:05:53 +0200</pubDate>
</item>
<item>
 <title>Citation Extraction: extracting text from PDF/PS</title>
 <link>http://www.hjournal.org/aci/citationextraction/pdfps2text</link>
 <description>&lt;h2&gt;I candidati&lt;/h2&gt;

&lt;h3&gt;pdftotext (from xpdf v 3.00) (GPL)&lt;/h3&gt;

&lt;pre&gt;
$ pdftotext articolo.pdf
&lt;/pre&gt;

&lt;p&gt;
Non conserva alcuna informazione sul layout, a parte i fine pagina
(^L), eliminabili con l'opzione &lt;code&gt;-nopgbrk&lt;/code&gt;. Tutte le linee ed i
paragrafi vengono stampati di seguito.

&lt;pre&gt;
$ pdftotext -layout articolo.pdf
&lt;/pre&gt;

&lt;p&gt;
Viene mantenuta una certa quantità di informazione relativa al layout
(fine linea, ma non fine paragrafo, e indentazione). Rimane comunque
difficile identificare le note a piè pagina.

&lt;h3&gt;pdftohtml 0.36 (GPL)&lt;/h3&gt;

&lt;pre&gt;
$ pdftohtml -nomerge -noframes articolo.pdf
&lt;/pre&gt;

&lt;p&gt;
Restituisce un HTML senza paragrafi con un'informazione sul layout del
documento paragonabile a quella di pdftotext -layout. Infatti non
preserva nemmeno le differenti dimensioni del testo.

&lt;pre&gt;
$ pdftohtml -nomerge -noframes -c articolo.pdf
&lt;/pre&gt;

&lt;p&gt;
Restituisce un HTML "complesso". Il layout del PDF è ricreato il più
accuratamente possibile, ma tramite posizionamento assoluto di ogni
riga e immagini di fondo ad ogni pagina. Estratto dall'output:

&lt;pre&gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:669;left:191&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft2&amp;quot;&amp;gt;BIBLIOGRAFIA &amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;

&amp;lt;DIV style=&amp;quot;position:absolute;top:700;left:191&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft2&amp;quot;&amp;gt; &amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:731;left:234&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft2&amp;quot;&amp;gt;B&amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:735;left:246&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft3&amp;quot;&amp;gt;INKOFSKI&amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:731;left:312&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft2&amp;quot;&amp;gt;,&amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:735;left:317&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft3&amp;quot;&amp;gt; &amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:731;left:321&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft2&amp;quot;&amp;gt;F.&amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:735;left:336&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft3&amp;quot;&amp;gt; &amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:731;left:340&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft2&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:735;left:349&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft3&amp;quot;&amp;gt; &amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;

&amp;lt;DIV style=&amp;quot;position:absolute;top:731;left:353&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft2&amp;quot;&amp;gt;B&amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:735;left:365&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft3&amp;quot;&amp;gt;UCCINO&amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:731;left:420&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft2&amp;quot;&amp;gt;,&amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:735;left:425&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft3&amp;quot;&amp;gt; &amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:731;left:429&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft2&amp;quot;&amp;gt;G.&amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:735;left:446&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft3&amp;quot;&amp;gt; &amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:731;left:451&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft2&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:735;left:460&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft3&amp;quot;&amp;gt; &amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:731;left:464&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft2&amp;quot;&amp;gt;P&amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;

&amp;lt;DIV style=&amp;quot;position:absolute;top:735;left:474&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft3&amp;quot;&amp;gt;OSSE&amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:731;left:509&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft2&amp;quot;&amp;gt;,&amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:735;left:513&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft3&amp;quot;&amp;gt; &amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:731;left:518&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft2&amp;quot;&amp;gt;S. et al. (1999), &amp;lt;i&amp;gt;
A frontal &amp;lt;/i&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:762;left:191&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft4&amp;quot;&amp;gt;&amp;lt;i&amp;gt;parietal-circuit 
for object manipulation in man: evidence from an &amp;lt;/i&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;
&amp;lt;DIV style=&amp;quot;position:absolute;top:793;left:191&amp;quot;&amp;gt;&amp;lt;nobr&amp;gt;&amp;lt;span class=&amp;quot;ft4&amp;quot;&amp;gt;&amp;lt;i&amp;gt;fMRI study&amp;lt;/i&amp;gt;, 
Eur. Journal of Neuroscience, 11, 3276-3286. &amp;lt;/span&amp;gt;&amp;lt;/nobr&amp;gt;&amp;lt;/DIV&amp;gt;&lt;/pre&gt;

&lt;h3&gt;ps2ascii 8.01 (from GhostScript package) (GPL)&lt;/h3&gt;

&lt;pre&gt;
$ ps2ascii articolo.pdf &gt; articolo.txt
&lt;/pre&gt;

&lt;p&gt;
Non preserva nulla del layout, ma &lt;em&gt;riconosce i paragrafi marcandoli con
2 newline&lt;/em&gt;. Peccato che su certi documenti lo fa e su certi fallisce.

&lt;h3&gt;pstotext 1.9 of 2003-01-09&lt;/h3&gt;

&lt;pre&gt;
$ pstotext articolo.ps &gt; articolo.txt
&lt;/pre&gt;

&lt;p&gt;
Non divide i paragrafi.

&lt;pre&gt;
$ pstotext -bboxes articolo.ps &gt; articolo.txt
&lt;/pre&gt;

&lt;p&gt;
Restituisce una riga per ogni parola, con la bounding box della parola.
Esempio:
&lt;pre&gt;
   147     359     225     373  BIBLIOGRAFIA

   173     320     224     334  BINKOFSKI,
   228     320     236     334  F.
   240     320     245     334  --
   248     320     292     334  BUCCINO,
   296     320     306     334  G.
   309     320     314     334  --
   318     320     348     334  POSSE,
   351     320     359     334  S.
   364     320     371     334  et
   375     320     385     334  al.
   389     320     421     334  (1999),
&lt;/pre&gt;

&lt;h3&gt;PreScript 0.1 e 2.2&lt;/h3&gt;

&lt;p&gt;Sono disponibili &lt;a href="http://www.nzdl.org/technology/prescript.html"&gt;qui&lt;/a&gt;.
Il convertitore di testo sembra dividere abbastanza bene i paragrafi (con newline), 
ma non sa gestire caratteri accentati e qualunque tipo di carattere speciale.
Il convertitore html non è degno di nota, fa più o meno quello che fa &lt;i&gt;pdftohtml&lt;/i&gt; 
in modalità normale.

&lt;h3&gt;ps2html_beta&lt;/h3&gt;

&lt;p&gt;
Ho scaricato &lt;i&gt;ps2html_beta&lt;/i&gt; da 
&lt;a href="http://www.csd.uch.gr/~nikop/thesis.html"&gt;questo sito&lt;/a&gt;.

&lt;p&gt;
Ottime promesse (riconoscimento addirittura delle note a piè pagina).
Ma bisogna farlo funzionare. Invece non funziona, probabilmente perché
è un programma del 1999. Dà un errore nell'invocazione di GhostScript.

&lt;br/&gt;&lt;br/&gt;

&lt;h2&gt;Test esaustivi e risultati&lt;/h2&gt;

&lt;p&gt;
Ho provato 10 casi di utilizzo dei precedenti programmi
su un insieme di 5 documenti PDF diversi per modalità di creazione
(rendering PDF da documenti DOC, documenti TeX ecc.), per contenuto
(note a piè pagina piuttosto che bibliografia) e per struttura
(ce n'era uno su due colonne). La conversione in PostScript, quando
necessaria, è stata fatta con il programma &lt;i&gt;pdftops&lt;/i&gt;.

&lt;p&gt;
Ecco la
&lt;a href="aci/citationextraction/pdfps2text/table"&gt;
 tabella dei risultati&lt;/a&gt;.

&lt;br/&gt;&lt;br/&gt;

&lt;h2&gt;Conclusioni&lt;/h2&gt;

&lt;p&gt;
Ok, finora la combinazione migliore sembra essere:
&lt;ul&gt;
&lt;li&gt;
&lt;i&gt;pdftohtml -nomerge -noframes -c&lt;/i&gt; per estrarre il testo, le informazioni sul dimensioni dei caratteri e sul posizionamento;
&lt;/li&gt;
&lt;li&gt;
&lt;i&gt;prescript html&lt;/i&gt; per separare i paragrafi e per gestire il testo su più colonne.
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Tutti questi tool sono rilasciati con licenza GPL.
</description>
 <pubDate>Wed, 27 Apr 2005 12:12:03 +0200</pubDate>
</item>
<item>
 <title>Autonomous Citation Indexing: Bibliography</title>
 <link>http://www.hjournal.org/aci/bibliography</link>
 <description>&lt;h3&gt;Citeseer&lt;/h3&gt;
&lt;p&gt;
&lt;a href="http://www.citeseer.org"&gt;Citeseer&lt;/a&gt; è un archivio di articoli
nell'ambito dell'informatica, che mantiene e aggiorna automaticamente il
grafo delle citazioni presenti negli articoli.
&lt;p&gt;
Citeseer è capace sia di citation extraction che di citation matching.

&lt;p&gt;
Un'introduzione al sistema con una descrizione dei principi usati per il
riconoscimento delle citazioni è
&lt;a href="http://citeseer.ist.psu.edu/aci-computer/aci-computer99.html"&gt;
qui&lt;/a&gt;.
&lt;p&gt;
Un'introduzione più approfondita è
&lt;a href="http://citeseer.ist.psu.edu/lawrence99indexing.html"&gt;
 Steve Lawrence, Kurt Bollacker, C. Lee Giles,
Indexing and Retrieval of Scientific Literature (1999)
&lt;/a&gt;.

Un articolo che descrive più in particolare l'architettura
e gli algoritmi usati da CiteSeer è
&lt;a href="http://citeseer.ist.psu.edu/bollacker98citeseer.html"&gt;
 Kurt D. Bollacker, Steve Lawrence, C. Lee Giles,
CiteSeer: An Autonomous Web Agent for Automatic Retrieval 
and Identification of Interesting Publications (1998)
&lt;/a&gt;. In quest'articolo è descritto abbastanza bene l'algoritmo usato
per stabilire se più citazioni si riferiscono allo stesso articolo.

&lt;p&gt;L'algoritmo usato per il citation matching è descritto in
&lt;a href="http://citeseer.ist.psu.edu/lawrence99autonomous.html"&gt;
Autonomous Citation Matching (1999)
&lt;/a&gt;.

&lt;h3&gt;&lt;a href="http://cscl.iis.sinica.edu.tw/documents/bobby/Extracting Citation Metadata from Online Publication Lists Using BLAST.pdf"&gt;
</description>
 <pubDate>Tue, 28 Feb 2006 14:55:52 +0100</pubDate>
</item>
<item>
 <title>Citation extraction project page</title>
 <link>http://www.hjournal.org/aci/citationextraction</link>
 <description>&lt;b&gt;Citation extraction&lt;/b&gt; works transforming various document formats (PDF, PostScript, DOC) in an internal rapresentation, and then extracting the citations from the document text.

&lt;p&gt;&lt;a href="/aci/citationextraction/pdfps2text"&gt;Extracting text from PDF/PS&lt;/a&gt;.

&lt;p&gt;&lt;a href="http://85.39.151.142/~pier/"&gt;Here&lt;/a&gt;'s a repository to gather statistics about documents collected from the Internet. Data from these documents will serve to improve the algorithms and to test their effectiveness.</description>
 <pubDate>Fri, 06 Oct 2006 18:27:14 +0200</pubDate>
</item>
<item>
 <title>Autonomous Citation Indexing</title>
 <link>http://www.hjournal.org/aci</link>
 <description>&lt;p&gt;
&lt;b&gt;Autonomous Citation Indexing&lt;/b&gt; for HyperJournal will add to the software two features in the submission of the articles: &lt;i&gt;citation extraction&lt;/i&gt; and &lt;i&gt;citation matching&lt;/i&gt;.

&lt;h3&gt;Definitions&lt;/h3&gt;
&lt;dl&gt;
&lt;dt&gt;Citation extraction&lt;/dt&gt;
&lt;dd&gt;Retrieval of all the bibliographic references from a non-structured document. Typically documents contain references in a separate section (the Bibliography) or in the footnotes.&lt;/dd&gt;
&lt;dt&gt;Citation matching&lt;/dt&gt;
&lt;dd&gt;Mapping a bibliographic reference to the resource it points to. This is done by comparing relevant information about the reference against the base of known bibliographic entries.
&lt;/dd&gt;
&lt;/dl&gt;

&lt;p&gt;
These are to be thought as separate tasks; a separate project page is maintained for each of the two:

&lt;p&gt;
&lt;a href="/aci/citationextraction"&gt;Citation extraction project page&lt;/a&gt;.

&lt;p&gt;
A bibliography consisting of commented articles about existing work is &lt;a href="aci/bibliography"&gt;here&lt;/a&gt;.
</description>
 <pubDate>Tue, 26 Apr 2005 15:03:41 +0200</pubDate>
</item>
<item>
 <title>Features</title>
 <link>http://www.hjournal.org/features</link>
 <description>&lt;h3&gt;&lt;i&gt;Version 0.3&lt;/i&gt;&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Graphic design which is user-friendly and easily customisable&lt;/li&gt;
&lt;li&gt;&lt;a href="features/#context"&gt;Dynamic contextualization&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Automatic Backup&lt;/li&gt;
&lt;li&gt;Anonymous article submission&lt;/li&gt;
&lt;li&gt;&lt;a href="features/#peer"&gt;Anonymous on-line peer-review&lt;/a&gt;&lt;/li&gt; 
&lt;li&gt;Category and number management&lt;/li&gt;
&lt;li&gt;&lt;a href="features/#committees"&gt;An unlimited number of scientific and editorial committees&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;A plurality of permitted file formats (which can be restricted according to your needs): XML, HTML, LateX, MsWord, PDF, PS, Jpeg, gif, png.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;&lt;i&gt;Forthcoming features (version 0.4)&lt;/i&gt;&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="features/#oai"&gt;Compliance with the Open Archive OAI-PMH protocol&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;A customisable editing-publishing stage&lt;/li&gt;
&lt;li&gt;Conversion of XML or LateX articles into in HTML or PDF version.&lt;/li&gt;
&lt;li&gt;Entirely modifiable style sheets for visualizing and printing articles.&lt;/li&gt;
&lt;li&gt;Validation and article conversion tools for registered users.&lt;/li&gt; 
&lt;/ul&gt;
&lt;a href="http://www.hjournal.org/overview"&gt;Hyperjournal Overview&lt;/a&gt;&lt;br/&gt;
&lt;hr size="1" /&gt;

&lt;a name="context"&gt;&lt;h4&gt;&lt;font size="3"&gt;Dynamic Contextualization&lt;/font&gt;&lt;/h4&gt;&lt;/a&gt;
Dynamic Contextualization is a P2P tool, based on semantic web technologies, which allows readers to visualize, while reading an article, all the articles quoted by and all those quoting the one they are reading.
</description>
 <pubDate>Wed, 13 Jul 2005 10:29:39 +0200</pubDate>
</item>
<item>
 <title>How to Help</title>
 <link>http://www.hjournal.org/howtohelp</link>
 <description>&lt;p&gt;
HyperJournal is a collaborative, open source software development project. As such, we welcome contributions from other developers.
&lt;/p&gt;
&lt;p&gt;
We'd like to note, however, that &lt;b&gt;you don't need to be a programmer to contribute.&lt;/b&gt; Here is a list of other ways in which you can help:
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="files/brochure_hj.pdf"&gt;Download the Brochure&lt;/a&gt;, print it out and distribute it to your friends and colleagues!!! &lt;/li&gt;
&lt;li&gt;Send us your translations of the documentation, user interface or this site&lt;/li&gt; 
&lt;li&gt;Submit bug reports&lt;/li&gt;
&lt;li&gt;Let us know in the &lt;a href="http://www.hjournal.org/mailinglist"&gt;mailing list&lt;/a&gt; or at info (at) hjournal (dot) org what you find most confusing about the software, or which parts of the user interface could be improved, so we can make it easier to use&lt;/li&gt;
&lt;li&gt;Submit suggestions for making the documentation clearer in the &lt;a href="http://www.hjournal.org/mailinglist"&gt;mailing list&lt;/a&gt; or to info (at) hjournal (dot) org.&lt;/li&gt; 
&lt;li&gt;Suggest improvements or useful features. We recommend posting them to the &lt;a href="http://www.hjournal.org/mailinglist"&gt;mailing list&lt;/a&gt;, so others can have a chance to add their input&lt;/li&gt; 
&lt;li&gt;Write additional documentation / user guides&lt;/li&gt;
&lt;li&gt;Drop a line and let us know how the software is being used and where&lt;/li&gt; 
&lt;li&gt;Help us advertise the software by telling others in your industry or organization about it. More users means more potential contributors.&lt;/li&gt; 
</description>
 <pubDate>Sat, 18 Feb 2006 18:07:59 +0100</pubDate>
</item>
<item>
 <title>Demo</title>
 <link>http://www.hjournal.org/demo</link>
 <description>&lt;h1&gt;
&lt;font color="red"&gt;

The Demo is temporarily off-line due to server change. We apologize for the inconvenience. The demo will be available as soon as possible.
&lt;/font&gt;
&lt;/h1&gt;

&lt;!--
&lt;h3&gt;Version 0.5 beta&lt;/h3&gt;
&lt;a href="http://85.39.151.142/hj05b/" target="_blank"&gt;Full Featured Demo (0.5 beta "Snowdomes")&lt;/a&gt; (Free access to all hyperjournal including the administrative area, tough some features are disabled. The content is automatically restored once a day. So if you find it screwed-up try tomorrow, you may have better luck! :-))
&lt;br /&gt;&lt;br /&gt;
&lt;h3&gt;Version 0.4 stable&lt;/h3&gt;
&lt;a href="http://85.39.151.142/hj04stable/" target="_blank"&gt;Full Featured Demo (0.4 stable "fever")&lt;/a&gt; (Free access to all hyperjournal including the administrative area, tough some features are disabled. The content is automatically restored once a day. So if you find it screwed-up try tomorrow, you may have better luck! :-))
--&gt;</description>
 <pubDate>Mon, 15 Oct 2007 14:22:53 +0200</pubDate>
</item>
<item>
 <title>Bugtracker</title>
 <link>http://www.hjournal.org/bugtrack</link>
 <description>To submit &lt;b&gt;bugs and suggestions&lt;/b&gt; on how to improve hyperjournal use the &lt;a href="http://85.39.151.139/hj_bugtrack/"&gt;bugtracker&lt;/a&gt;. 
The bugtracker login page is at &lt;a href="http://85.39.151.139/hj_bugtrack/"&gt;http://85.39.151.139/hj_bugtrack/&lt;/a&gt;
Thank you!</description>
 <pubDate>Fri, 06 May 2005 11:10:47 +0200</pubDate>
</item>
<item>
 <title>Developers</title>
 <link>http://www.hjournal.org/doc_dev</link>
 <description>&lt;p&gt;&lt;strong&gt;Code, Code Documentation, Developers Tutorial&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt; &lt;a href="/doc_dev/translating"&gt;Translating HyperJournal&lt;/a&gt; &lt;/li&gt;
&lt;li&gt; &lt;a href="http://85.39.151.140/~hj/cgi-bin/viewcvs/" target="_blank"&gt;CVS&lt;/a&gt; web access&lt;/li&gt;
&lt;li&gt;&lt;a href="http://85.39.151.140/~michele/" target="_blank"&gt;API&lt;/a&gt; Beta Documentation&lt;/li&gt;
&lt;/ul&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Data Base schema&lt;/strong&gt; (images are in .emf format) [&lt;a href="http://80.19.150.243/~didonato/hjournal/DB.html" target="_blank"&gt;html&lt;/a&gt;]&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Use cases, Class Diagrams, Interaction Diagrams&lt;/strong&gt; [&lt;a href="http://80.19.150.243/~didonato/hjournal/use.html" target="_blank"&gt;html&lt;/a&gt;]&lt;/li&gt;
&lt;li&gt;G. Tummarello et al., &lt;strong&gt;Contextualization Engine description&lt;/strong&gt; [&lt;a href="http://www.hyperl.org/files/15-RDFGROWth_workshopISWC2004.pdf" target="_blank"&gt;PDF&lt;/a&gt;]&lt;/li&gt;
&lt;/ul&gt;
</description>
 <pubDate>Fri, 06 May 2005 11:08:04 +0200</pubDate>
</item>
<item>
 <title>Howtos</title>
 <link>http://www.hjournal.org/doc_howto</link>
 <description>&lt;p&gt;In this page you find some preliminary Howtos:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Documentation Howto [&lt;a href="http://80.19.150.243/~didonato/hjournal/dochowto.html" target="_blank"&gt;it&lt;/a&gt; - en]&lt;/li&gt;
&lt;li&gt;Collaborate: guidelines [&lt;a href="http://80.19.150.243/~didonato/hjournal/istruzioni_autori_it.html" target="_blank"&gt;it&lt;/a&gt; - en]&lt;/li&gt;
&lt;li&gt; Tools for writing Documentation [&lt;a href="http://80.19.150.243/~didonato/hjournal/jaxe.html" target="_blank"&gt;it&lt;/a&gt;- en] &lt;/li&gt;
</description>
 <pubDate>Sun, 14 Nov 2004 19:20:12 +0100</pubDate>
</item>
</channel>
</rss>
