NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.
All the Perl that's Practical to Extract and Report
Stories, comments, journals, and other submissions on use Perl; are Copyright 1998-2006, their respective owners.
Antiword may be of some help (Score:1)
I've used antiword [demon.nl] in the past for reading MS Word docs, but I don't know how well it reads tables. You might want to give it a try.
Re:Antiword may be of some help (Score:2)
Awesome!!! This is entirely feasible! Thank you!
The tables come out into elements called <informaltable>. I can parse that XML, extract those, and convert them. In fact it looks like this is better than going to Excel because going to Excel provides several "phantom" blank cells which I have to ignore in my current program.
I'm not sure if I'm going to have to do this specific file again, but there's a good chance I might, and if I do I will attempt to program this process. If I don't for this file, I know I will again for another. So at some point there may be a table extractor utility available for everyone to use.
J. David works really hard, has a passion for writing good software, and knows many of the world's best Perl programmers
Reply to This
Parent
Re:Antiword may be of some help (Score:1)
Glad it's working out for you. I haven't used antiword in over a year but it was very helpful when I needed it.