Multiple TextStart & TextEnd

Hello !

I’m trying to write a DA plugin to search some forums (Invision Power Board). However it doesn’t work very well… Part of the problem is that the HTML of a thread has the following form :

blablabla
Some fixed text before the firt message
blablabla

<!--BEGINNING OF POST-->
Some intersting text
<!--END OF POST-->
Some junk text

<!--BEGINNING OF POST-->
Some intersting text
<!--END OF POST-->
Some junk text

<!--BEGINNING OF POST-->
Some intersting text
<!--END OF POST-->
Some junk text

...

blablabla
Some fixed text after the last message
blablabla

At first I tried this setting :

  • TextStart=Some fixed text before the firt message
  • TextEnd=Some fixed text after the last message

But DA is confused by the repetitive junk between posts… So I tried this way :

  • TextStart=
  • TextEnd=

But now the results are really poor. My guess is that DA only inspects the first entry, and does not allow multiple TextStart and TextStop in a page…

Did I miss something (I’m a total newbie to DA !) ? Is there a workaround ? And if not… any chance to see it in a next release :unamused: ?

In this case Text{Start,End} don’t help you so you can just delete these from the plugin (don’t use empty values). Then DA will use the complete page for analysis.

As I’m having similar issues with a different project right now, maybe we should consider adding StartTag/EndTag options :wink: