<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: HTML Data Extraction By perl scripting in Operating System - HP-UX</title>
    <link>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943352#M760667</link>
    <description>really amazing...you have done a great job.very much thankfull to you.&lt;BR /&gt;for now its really enough...&lt;BR /&gt;i think i will get more job on this script.keep in touch.&lt;BR /&gt;thanks to all...</description>
    <pubDate>Wed, 14 Feb 2007 01:10:19 GMT</pubDate>
    <dc:creator>Dodo_5</dc:creator>
    <dc:date>2007-02-14T01:10:19Z</dc:date>
    <item>
      <title>HTML Data Extraction By perl scripting</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943343#M760658</link>
      <description>i have a HTML report file..its in attachment(a part of the whole report is attached)&lt;BR /&gt;i just want to seperate the datas like in first line it should be..&lt;BR /&gt;&lt;BR /&gt;NHTEST-3848498958-NHTEST-10.2-no-baloo a&lt;BR /&gt;and so on for whole report&lt;BR /&gt;&lt;BR /&gt;how to seperate the datas from tables that kind of format with the use of perl(or unix )scripting.&lt;BR /&gt;&lt;BR /&gt;please help guys..write the script as a whole pls.otherwise it will be difficult to understand for me&lt;BR /&gt;its urgent plsss...</description>
      <pubDate>Tue, 13 Feb 2007 01:51:21 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943343#M760658</guid>
      <dc:creator>Dodo_5</dc:creator>
      <dc:date>2007-02-13T01:51:21Z</dc:date>
    </item>
    <item>
      <title>Re: HTML Data Extraction By perl scripting</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943344#M760659</link>
      <description>Hey&lt;BR /&gt;&lt;BR /&gt;You can make an other snapshot:&lt;BR /&gt;9i:  @?/rdbms/admin/spreport&lt;BR /&gt;10g: @?/rdbms/admin/awrrpt.sql&lt;BR /&gt;&lt;BR /&gt;then choose text as output format, this is easier to modify.&lt;BR /&gt;&lt;BR /&gt;Regards</description>
      <pubDate>Tue, 13 Feb 2007 03:18:36 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943344#M760659</guid>
      <dc:creator>Oviwan</dc:creator>
      <dc:date>2007-02-13T03:18:36Z</dc:date>
    </item>
    <item>
      <title>Re: HTML Data Extraction By perl scripting</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943345#M760660</link>
      <description>Hi,&lt;BR /&gt;surely TCS has the resource/experience to do this ?!&lt;BR /&gt;&lt;BR /&gt;Take the report in html format, pull out the table rows (marked by TR and /TR).&lt;BR /&gt;Then remove all HTML markers and what you have left is the table data without HTML markers.&lt;BR /&gt;&lt;BR /&gt;Quck check on the web:&lt;BR /&gt;&lt;A href="http://www.thescripts.com/forum/thread49414.html" target="_blank"&gt;http://www.thescripts.com/forum/thread49414.html&lt;/A&gt;&lt;BR /&gt;&lt;A href="http://www.wdvl.com/Authoring/Languages/Perl/PerlfortheWeb/summarizer.html" target="_blank"&gt;http://www.wdvl.com/Authoring/Languages/Perl/PerlfortheWeb/summarizer.html&lt;/A&gt;&lt;BR /&gt;&lt;A href="http://www.unix.org.ua/orelly/perl/cookbook/ch20_07.htm" target="_blank"&gt;http://www.unix.org.ua/orelly/perl/cookbook/ch20_07.htm&lt;/A&gt;&lt;BR /&gt;&lt;A href="http://cpan.uwinnipeg.ca/htdocs/HTML-Strip/HTML/Strip.html" target="_blank"&gt;http://cpan.uwinnipeg.ca/htdocs/HTML-Strip/HTML/Strip.html&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Please also read:&lt;BR /&gt;&lt;A href="http://forums1.itrc.hp.com/service/forums/helptips.do?#33" target="_blank"&gt;http://forums1.itrc.hp.com/service/forums/helptips.do?#33&lt;/A&gt; on how to reward any useful answers given to your questions.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 13 Feb 2007 03:20:05 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943345#M760660</guid>
      <dc:creator>Peter Godron</dc:creator>
      <dc:date>2007-02-13T03:20:05Z</dc:date>
    </item>
    <item>
      <title>Re: HTML Data Extraction By perl scripting</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943346#M760661</link>
      <description>when i tried to run the scripts then it shows as:&lt;BR /&gt;Can't locate HTML/TableExtract.pm in @INC (@INC contains: /usr/lib/perl5/5.8.5/i386-linux-thread-multi &lt;BR /&gt;&lt;BR /&gt;actually i dont have admin rights on machine.&lt;BR /&gt;can you please help writing a perl script to extract datas from the table in a html file which exixts in my pc(not like a URL,taking it as a file in pc)</description>
      <pubDate>Tue, 13 Feb 2007 04:18:46 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943346#M760661</guid>
      <dc:creator>Dodo_5</dc:creator>
      <dc:date>2007-02-13T04:18:46Z</dc:date>
    </item>
    <item>
      <title>Re: HTML Data Extraction By perl scripting</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943347#M760662</link>
      <description>go through the source of the html file...pls send me solution.its urgent..&lt;BR /&gt;its a part of whole report.</description>
      <pubDate>Tue, 13 Feb 2007 06:50:41 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943347#M760662</guid>
      <dc:creator>Dodo_5</dc:creator>
      <dc:date>2007-02-13T06:50:41Z</dc:date>
    </item>
    <item>
      <title>Re: HTML Data Extraction By perl scripting</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943348#M760663</link>
      <description>Hi:&lt;BR /&gt;&lt;BR /&gt;&amp;gt; ...how to seperate the datas from tables that kind of format with the use of perl(or unix )scripting.  i dont have administrator rights in my pc.so pls send script without having such commands(which needs admin privelege)...please help guys..write the script as a whole pls. otherwise it will be difficult to understand for me&lt;BR /&gt;its urgent plsss...&lt;BR /&gt;&lt;BR /&gt;Without payment for doing your job, I don't think anyone is going to write a solution that you earn you your pay.&lt;BR /&gt;&lt;BR /&gt;use Perl;&lt;BR /&gt;&lt;BR /&gt;That said, however, you don't need administrator rights to install modules locally in directories with which you have write-access.&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://www.cpan.org/modules/INSTALL.html" target="_blank"&gt;http://www.cpan.org/modules/INSTALL.html&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;As for parsing the HTML, you should look at modules like: HTML::Parser, HTML::FormatText, HTML::LinkExtor just to name a few.  Fetch what you need from CPAN:&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://www.cpan.org/" target="_blank"&gt;http://www.cpan.org/&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Regards!&lt;BR /&gt;&lt;BR /&gt;...JRF...&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 13 Feb 2007 07:47:09 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943348#M760663</guid>
      <dc:creator>James R. Ferguson</dc:creator>
      <dc:date>2007-02-13T07:47:09Z</dc:date>
    </item>
    <item>
      <title>Re: HTML Data Extraction By perl scripting</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943349#M760664</link>
      <description>Hi, &lt;BR /&gt;&lt;BR /&gt;I'd like to second Jame's statement about doing your chores.&lt;BR /&gt;Parsing tagged markup is a bit more involved, especially if it's not well formed.&lt;BR /&gt;But there exist standard Perl HTML parsers for the task.&lt;BR /&gt;Basically there seem to be two avenues.&lt;BR /&gt;Either use HTML::TreeBuilder&lt;BR /&gt;&lt;A href="http://search.cpan.org/~petek/HTML-Tree-3.23/lib/HTML/TreeBuilder.pm" target="_blank"&gt;http://search.cpan.org/~petek/HTML-Tree-3.23/lib/HTML/TreeBuilder.pm&lt;/A&gt;&lt;BR /&gt;or HTML::TokeParser&lt;BR /&gt;&lt;A href="http://search.cpan.org/~gaas/HTML-Parser-3.56/lib/HTML/TokeParser.pm" target="_blank"&gt;http://search.cpan.org/~gaas/HTML-Parser-3.56/lib/HTML/TokeParser.pm&lt;/A&gt;&lt;BR /&gt;If you can afford I would suggest trying both modules two get the different idea how HTML can be treated.&lt;BR /&gt;Of course almost every scripting language should have HTML parsers for this purpose.</description>
      <pubDate>Tue, 13 Feb 2007 07:58:38 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943349#M760664</guid>
      <dc:creator>Ralph Grothe</dc:creator>
      <dc:date>2007-02-13T07:58:38Z</dc:date>
    </item>
    <item>
      <title>Re: HTML Data Extraction By perl scripting</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943350#M760665</link>
      <description>thanks but expecting a little bit more from you experts.</description>
      <pubDate>Tue, 13 Feb 2007 07:59:58 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943350#M760665</guid>
      <dc:creator>Dodo_5</dc:creator>
      <dc:date>2007-02-13T07:59:58Z</dc:date>
    </item>
    <item>
      <title>Re: HTML Data Extraction By perl scripting</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943351#M760666</link>
      <description>Ok Dodo,&lt;BR /&gt;without any guarantee if this will be of any value I tinkered up the tiny attached script which uses Perl and the HTML::TokeParser module.&lt;BR /&gt;You will have to check for yourself what exactly your HTML looks like and what you need to really parse.&lt;BR /&gt;E.g. my script produces this:&lt;BR /&gt;&lt;BR /&gt;$ ./shp.pl&lt;BR /&gt;NHTEST&lt;BR /&gt;3848498958&lt;BR /&gt;NHTEST&lt;BR /&gt;1&lt;BR /&gt;10.2.0.2.0&lt;BR /&gt;NO&lt;BR /&gt;baloo_a&lt;BR /&gt;Begin Snap:&lt;BR /&gt;1728&lt;BR /&gt;02-Feb-07 20:00:35&lt;BR /&gt;20&lt;BR /&gt;3.1</description>
      <pubDate>Tue, 13 Feb 2007 10:04:19 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943351#M760666</guid>
      <dc:creator>Ralph Grothe</dc:creator>
      <dc:date>2007-02-13T10:04:19Z</dc:date>
    </item>
    <item>
      <title>Re: HTML Data Extraction By perl scripting</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943352#M760667</link>
      <description>really amazing...you have done a great job.very much thankfull to you.&lt;BR /&gt;for now its really enough...&lt;BR /&gt;i think i will get more job on this script.keep in touch.&lt;BR /&gt;thanks to all...</description>
      <pubDate>Wed, 14 Feb 2007 01:10:19 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/html-data-extraction-by-perl-scripting/m-p/3943352#M760667</guid>
      <dc:creator>Dodo_5</dc:creator>
      <dc:date>2007-02-14T01:10:19Z</dc:date>
    </item>
  </channel>
</rss>

