goon's Journal http://use.perl.org/~goon/journal/ goon's use Perl Journal en-us use Perl; is Copyright 1998-2006, Chris Nandor. Stories, comments, journals, and other submissions posted on use Perl; are Copyright their respective owners. 2012-01-25T02:43:30+00:00 pudge pudge@perl.org Technology hourly 1 1970-01-01T00:00+00:00 goon's Journal http://use.perl.org/images/topics/useperl.gif http://use.perl.org/~goon/journal/ m$word-pdf, maestro and australia day http://use.perl.org/~goon/journal/17027?from=rss <p><b>doc-&gt;pdf cont.</b> </p><p>looks like I have <a href="http://www.winfield.demon.nl/">found</a> a possible quicker path to get word docs to pdf. (wonder if it can handle complex tables and embedded images?).</p><p><em>update:</em><br><a href="http://use.perl.org/~Matts">matts</a> (<a href="http://www.axkit.org/">axkit</a>) just left a message on my <a href="http://use.perl.org/comments.pl?sid=18070">use.perl journal</a> reporting that he recommends using <a href="http://www.easysw.com/htmldoc/">HTMLDOC</a> for converting html to pdf. It also has an <a href="http://www.axkit.org/">Axkit</a> plugin. I checked out the <a href="http://www.easysw.com/htmldoc/htmldoc-man.html">man</a> and <a href="http://www.easysw.com/htmldoc/faq.php">faq</a>. GNU licensing, perl bindings and end user support available.</p><p><b>maestro</b> </p><p>downloading <a href="http://mars.telascience.org/softwaredownload">maestro </a> (38.5Mb) to check out <a href="http://mars.telascience.org/data1.html">mars rover data</a>. reminds me of the 3d <a href="http://slashdot.org/comments.pl?sid=93883&amp;cid=8059781">pathfinder back in 97</a>. posted detailed comparison of <a href="http://slashdot.org/comments.pl?sid=94235&amp;cid=8085074">rover ground images of opportunity and spirit</a>.</p><p><b>australia day</b></p><p> <a href="http://www.theage.com.au/articles/2004/01/26/1075087943566.html">25 Jan</a>. all I can say is <em>crack a tube! (Sound of cans opening)</em> and <a href="http://orangecow.org/pythonet/sketches/bruces.htm">Australia, Australia, Australia, Australia, we love you amen!</a>.</p><p><em>philosophers song - monty pythons flying circus - Episode <a href="http://orangecow.org/pythonet/sketches/two.htm#22">22</a> </em><br>Immanuel Kant was a real piss-ant who was very rarely stable.<br>Heideggar, Heideggar was a boozy beggar who could think you under the table.<br>David Hume could out-consume Wilhelm Freidrich Hegel.<br>And Whittgenstein was a beery swine who was just as sloshed as Schlegel.<br>There's nothing Nieizsche couldn't teach 'ya 'bout the raising of the wrist.<br>Socrates, himself, was permanently pissed.<br>John Stewart Mill, of his own free will, after half a pint of shanty was particularly ill.<br>Plato, they say, could stick it away, half a crate of whiskey every day!<br>Aristotle, Aristotle was a bugger for the bottle,<br>And Hobbes was fond of his Dram.<br>And Rene Descartes was a drunken fart:<br>"I drink, therefore I am."<br>Yes, Socrates himself is particularly missed;<br>A lovely little thinker, but a bugger when he's pissed.</p> goon 2004-01-27T05:52:32+00:00 others complex HTML--&gt;???--&gt;PDF? http://use.perl.org/~goon/journal/16986?from=rss <p> <b> <a href="http://www.perlmonks.org/index.pl?node_id=323745">the problem</a> </b> </p><ul> <li><em>but the current Word doc (the catalogue) has tables, graphics and was 'built' with Word templates, so I have no idea how ell it would all convert.</em></li> </ul><p>the site that got me interested in pdf was <a href="http://www.stason.org/me.html">Stas Beckmans</a> site, <a href="http://www.stason.org/">www.stason.org</a>. He gave a talk to the <a href="http://melbourne.pm.org/">melbourne pm</a> last year. Through the course of his talk on mod_perl 2 he showed the notes from his site in html with pdf downloads of the site.</p><p>So I tried to re-create this html-&gt;ps-&gt;pdf so that I too could have a printable version of a project I'm working on called Ratpile (make a directory that has *stuff* stored in it searchable by stuffing information about it into a relational database - data mining some may call it.) using perl+DBI+<a href="http://www.template-toolkit.org/">TT2</a>. The template I created is a <b>*bare bones*</b> html page sans images. This is the technique Stas is using with his <a href="http://www.stason.org/works/modules.html#docset">docset</a>.</p><p>the point I guess I'm trying to make is I've used text only and not images. I've done a bit of research and this is what I've come up with...</p><ul> <li>graphics are supported in <a href="http://www.webopedia.com/TERM/P/PostScript.html">postscript</a> (3?)</li> <li>others better (<a href="http://perlmonks.thepen.com/84237.html">ybiC</a>) than I, have hacked together <a href="http://perlmonks.thepen.com/84237.html">html-&gt;PS-&gt;PDF</a> code and appears to handle images via html2ps but not html tables (Create PostScript and PDF versions of all HTML files in given directory )</li> <li>one approach could be to use <a href="http://use.perl.org/~Matts/">Matt Sergeants</a>, PDFLib (<a href="http://search.cpan.org/~msergeant/PDFLib-0.12/PDFLib.pm#load_image(...)">load_image method</a>) a oo wrapper around pdflib by <a href="http://www.pdflib.com/products/pdflib/">www.pdflib.com</a>. but I seem to remember has restrictions for use under <a href="http://www.opensource.org/">OSI</a> (has to be opensource, private use or researcher).</li> <li>or use <a href="http://search.cpan.org/search?query=Alfred+Reibenschuh+&amp;mode=all">Alfred Reibenschuhs</a> - Text::PDF::API where I found via an old page <a href="http://search.cpan.org/~areibens/PDF-API2-0.3r77/lib/PDF/API2.pm">PDF-API2-0</a> which has some image (jpg,png,handleing capabilities</li> <li> <a href="http://www.logreport.org/lire/roadmap-2.0.php">logreport</a> has an interesting set of observations about html-&gt;PDF generation. Namely problems with html formatting and tables</li> </ul><p> <b>building html-&gt;PDF with images and troublesome html tables</b> </p><p>now given what we have found above I would suggest the following (unless anyone has a better idea) of using:</p><ul> <li>extract word document to html</li> <li>extract table data (word document via OLE) or (via html via <a href="http://search.cpan.org/~msisk/HTML-TableExtract-1.08/lib/HTML/TableExtract.pm">Html-TableExtract</a> - like latter better.)</li> <li>remove html tables in html documents</li> <li>reinsert data into a simple table using &lt;pre&gt; tags for layout and html tags for bolding, emphasis. Or find some other method by experimentation in html for representing tables (text)</li> <li> <a href="http://search.cpan.org/~areibens/PDF-API2-0.3r77/lib/PDF/API2.pm">PDF-API2</a> as the PDF renderer. This can all be done in code.</li> </ul><p>the real problem maybe rendering the tables generated from word. complicated layout in word (re-rendered to html) will have to be modified to the postscript syntax then rendered to PDF. The problem is defined by converting the html tables to pdf.</p><p>it is not rocket science to create a bit of code to extract the data from the table, re-create a table using PDF-API (and its child modules).</p><p> <b>but is there a shorcut?</b> </p><p>of course you could forget all the above and take your chances with Michael Frankl's <a href="http://search.cpan.org/~mfrankl/HTML-HTMLDoc-0.07/lib/HTML/HTMLDoc.pm">HTML-HTMLDOC</a> and convert you html files directly to PDF<nobr> <wbr></nobr>:)</p><p>the rocket science bit is trying to get this to work on cygwin or win32</p><p> <b>credits</b> </p><p>damn I love <a href="http://search.cpan.org/">cpan</a>.</p> goon 2004-01-25T03:17:00+00:00 markup Iain Truskett RIP 2003-12-29 http://use.perl.org/~goon/journal/16893?from=rss <p>update: 13JAN2003<a href="http://www.oreillynet.com/pub/wlg/4163"> Andy Lester (oreillynet)</a> writes<nobr> <wbr></nobr>... <em>I never met him, or even spoke to him except through email and IRC, but I feel like he was a friend. He was always willing to help out when possible, and at least discuss an idea when not.</em> </p><p> <em> <a href="http://www.perlmonks.org/index.pl?node_id=320541">perlmonks today</a> - </em> <em> Iain Campbell Truskett (<a href="http://use.perl.org/~koschei/">"Spoon"/"Koschei"</a>) RIP 2003-12-29 According to his girlfriend's blog, Koschei passed away from a cardiac arrest at the age of 24. He was a significant contributor to the CPAN, and frequently interacted with us on the Perl IRC channels. His wiki is still up for perusal. He will be missed. Rest In Peace, my friend. -- Randal L. Schwartz, Perl hacker </em> </p><p> <a href="http://www.google.com/search?hl=en&amp;ie=UTF-8&amp;oe=utf-8&amp;q=Iain+Truskett&amp;spell=1">Iain</a> will be missed. It was only <a href="http://slashdot.org/~goon/journal/51980">last November</a> that he gave up some time to present a talk on DateTime at the <a href="http://melbourne.pm.org/">melbourne.au PM</a>.</p><p>Iain talked in detail how the development was proceeding on the Datetime modules and how he spent the last 8 months hard work <em>crafting</em> ("hacking" is a poor choice of words. Conceptually DateTime is not for the weak of mind). The talk was well presented, informative and most of all amusing (especially the bit on the evolution of the <a href="http://search.cpan.org/dist/DateTime-Calendar-FrenchRevolutionary/">French Revolutionary Calander</a>)</p><p>I will certainly be thinking of him when I'm using any of his modules. His legacy is more than mere code.</p> goon 2004-01-18T22:09:21+00:00 newsnews