Hello, I’m not sure being in the right place to ask this, but if anyone knows…
I’m trying to build a plugin to browse scirus engine because it provides relevant information for my academic searches. scirus.com
For the moment I wrote this:
DA performs the query well, but what I don’t understand, is that DA analyses the result pages only, not the linked one. And of course, my aim is DA to analyse the abstract webpage of each paper in order to have the relevant ones.
If anyone of yours has an idea…
Thank you very much in advance,
PS: may be it would be a good idea to provide a space on your website where users could share their plugins (may be I didnt found it? ).
Yes, in fact scirus is a webcrawler performed by sciencedirect, so, when it performs journals searches, its on sciencedirect content. I have also tried to do a plugin to browse sciencedirect, but their searching url seems to be coded because you can’t find your query anywhere in the url.
Scirus performs also academic webpages search (&ds=web), but I asked it in my plugin to browse only journals (&ds=jnl) at this time.
Thats right, I can see the same “No match” pages in the log. But in the scirus search results pages displayed in the digest, the links are valid and I don’t understand why none of them appears in the digest.
I would like DA to process the abstracts backside these links instead of processing scirus result pages.
I did it, but even with the highest level, DA still gives me pages of scirus search results in the digest.
I don’t understand where I am supposed to use a wildcard? I do not have the menu you’re talking about, I have a follow link option in the ‘settings’ tab, but there is no place to enter text. Do you mean that I have to use ‘*’ in the query itself? (I’ve tried but it changes nothing).
Of course, but acces to abstracts is free at sciencedirect, you need a subscription only to download the articles.
I’ve noticed no difference between browsing scirus with DA or with Safari. I’ve noticed just one thing, its that the links often doesnt work the first time you click it. You have a warning telling you that the page doesn’t exist. And the second time it works. But it seems to be a random problem. And it also happens on Safari.
But while telling that, I’m wondering if it isn’t be the problem…
Thank you in advance for your help,
I would like to apologize for getting my versions mixed up. the testers are showing up in the forums, and they are just as eager to share what they know as the rest of you are to see the new public release. there have been many alpha releases, and there have been interface changes, bug fixes, and many, many features added. it is worth the wait.
I have simply downgraded temporarily to the current public release in order to see what is going on. I think there may be problems.
I have a couple of general observations:
what you are trying to do is to search pages from a variety of sources. when you use the keys “LinksStart” and “LinksEnd” they should be found in all of the pages to be searched. they are for efficiency, and in your case you don’t have many choices because of the variety of pages. you may want to omit these keys.
you are using the key “LinksNotMatching” with two Google specific terms you copied from an example. delete this key, and add a key “LinksMatching” as a * wildcard.
now, the bad news. I may be wrong about this.
if you examine the links, each has a redirect to the source.
after running a number of tests, if I examine the “Log”, there is only one page for each site, and it is cut off. for example:
I had the same conflict with the American Chemical Society publication service. Then I tried a search from within the DA using a crawler they locked my (ligitimate) access by IP with the similar notification message