Hola, explorer. He estado probando yo con el resto de las páginas, y vale, en una de ellas
consigo sacar esto :
Using html4strict Syntax Highlighting
<h4>BeCode evo</h4>
<p>User-friendly handling through simple operation <a href="/index.php?option=com_content&view=article&id=2&Itemid=7&lang=en">more ...</a></p>
<div class="image-box">
<p><a href="/index.php?option=com_content&view=article&id=2&Itemid=7&lang=en"><img class="reflect rheight30 ropacity40" src="/images/stories/produkt-menu/becode-evo.jpg" border="0" width="202" height="218" style="border: 0pt none;" /></a></p>
</div><span class="article_separator"> </span> </li>
<li>
Coloreado en 0.002 segundos, usando
GeSHi 1.0.8.4
con este código:
Using perl Syntax Highlighting
while ($html =~ m{<h4>(.+?)</h4>.+?<p>(.+?)</p>}smg){
Coloreado en 0.001 segundos, usando
GeSHi 1.0.8.4
pero para sacar esto que lo que está entre <h2> "Tobe or not to be" es el título y <p> "all locking system..." es la descripción:
Using html4strict Syntax Highlighting
<h2 class="contentheading">
<img src="/templates/beloxx/images/h2-left_bg.png" alt="" title="" />
Products - To Be or not to Be <img src="/templates/beloxx/images/h2-right_bg.png" alt="" title="" />
</h2>
<p class="buttonheading">
</p>
<table border="0">
<tbody style="text-align: left;">
<tr style="text-align: left;">
<td style="text-align: left;" align="left" valign="top">
<div class="small">
<p>All locking systems in the Beloxx product range have one thing in common: They are different from standard locking systems because of the easy installation, well thought-out operation and consistent design. Furniture can be upgraded very attractively and locking can be absolutely easy.</p>
<p>Here, Beloxx builds on true innovations with the BeCode: Instead of keys, complex wiring and difficult programming, the well-known BeCode system is activated simply be turning and pressing. Up to 100.000 locking cycles with one battery, more than 1 million user-code variations and 10 billion master-code variations are possible.</p>
<p>The BeCode is compatible to almost all popular mechanical locking systems so that retrofitting and use in electronic variations are very easy. See for yourself!</p>
</div>
Coloreado en 0.002 segundos, usando
GeSHi 1.0.8.4
he utilizado:
Using perl Syntax Highlighting
while ($html =~ m{<h2 class="contentheading">(.+?)</h2>.+?<div class="samll"><p>(.+?)</p><\div>}smg){
Coloreado en 0.001 segundos, usando
GeSHi 1.0.8.4
pero no consigo sacar nada. ¿Está mal hecho el patrón de búsqueda, no?
y por último, para conseguir sacar el
link que hay en cada
<a href :
Using html4strict Syntax Highlighting
</table>
<div style="padding:0px; font-size:8px; "><br /></div>
<span style="font-size:8pt;">Klicken Sie auf die Produktbilder für mehr Informationen über die entsprechende Produktkategorie:</span><br />
<input name="ctl00$ctl00$CenterBlockContents$WebFormContents$dummy" type="text" onchange="javascript:setTimeout('__doPostBack(\'ctl00$ctl00$CenterBlockContents$WebFormContents$dummy\',\'\')', 0)" onkeypress="if (WebForm_TextBoxKeyHandler(event) == false) return false;" id="ctl00_ctl00_CenterBlockContents_WebFormContents_dummy" style="visibility:hidden; Z-INDEX: 101; POSITION: absolute; LEFT: 1px; TOP: 1px" />
<table id="Tabelle_01" width="731" height="904" border="0" cellpadding="0" cellspacing="0">
<tr>
<td>
<a href="/KatalogCdr.aspx?page=3&key=1047">
<img border="0" src="UebersichtsSeiten_2008/Uebersicht_Seite_3/Bilder/1047.jpg" width="149" height="106" alt=""></a></td>
<td colspan="5">
<a href="/KatalogCdr.aspx?page=3&key=1048">
<img border="0" src="UebersichtsSeiten_2008/Uebersicht_Seite_3/Bilder/1048.jpg" width="149" height="106" alt=""></a></td>
<td colspan="7">
<a href="/KatalogCdr.aspx?page=3&key=1062">
Coloreado en 0.002 segundos, usando
GeSHi 1.0.8.4
con esto no me lo saca bien porque me saca más cosas:
Using perl Syntax Highlighting
while ($html =~ m{<a href="(.+?)">(.+?)</a>}smg){
Coloreado en 0.001 segundos, usando
GeSHi 1.0.8.4