Login | Sign Up | TaskPal | Forum | Help | rss En Cn Tw

Url Finder

Posted by

Harold Jeffries

on 08 Feb, 2010 | Bids : 5

Description

A desktop app that searches for websites in the main search engines, then follows url to that site.


We presently use a program in C++, Visual C++ (All source files available). The programmer is no longer alive. It works well but needs small mods and better memory management. You may start from scratch or use the original source files to modify.




The main core app;


Reading from text editable lists, the app visits the first listed search engine, searching for a listed keyword for a given number of pages deep, matching against the listed url. If a match is found it will open the url link listed by the search engine in the embedded browser (unless its a sponsoring link).


Random delays are set between searches and page visits as search engine will block a 'non human' or 'bot' action.


Hence the need to simulate the entire function as human as possible.




Taking an example from existing app, the editable files could be similar to;


Search engines;


" http://www.bing.com/search?q= ","&first=","",""


" http://uk.search.yahoo.com/search?p= ","&b=","",""


" http://uk.altavista.com/web/results?q= ","&stq=","",""




Urls;


mysite.com ,articles/mypage.html,keyword,


anothersite.biz ,,keyword,




In the above example, it would first search Bing XX pages deep for the mysite.com 'keyword'. If mysite.com is found, it will follow the url then after xx seconds, follow to 'articles/mypage.html' on that site. After remaining there for XX seconds, the next search to Yahoo is commenced for the same site and keyword, followed by Altavista. Cookies & cache cleared. Then starts again with 'bing' searching the given 'keyword' for ' anothersite.biz . And so on..... If the site url is not found for any keyword after xx searches, then the whole url ( ' mysite.com ') is searched for. If still not found, it moves on to next search.


NOTE** In ' anothersite.biz ' the app would follow any page on that site at random as no variable given.


The search engine and url lists will never be large (no more than 10 items long).


You will need to check syntax used by the main search engines, though the ones above are correct.


We would manually edit the search engine and url files, following the apps required syntax.




The control panel would have 'start' 'pause' 'stop' buttons. And place to enter options.


1. Random delay between xx seconds for search engine deep page queries. (this setting is also used to determine how long it stays on the sites first page before continuing to the second page)


2. Random and fixed delay option of xxx seconds to stay on second page before starting new search.




The internet connection can sometimes break for a couple of minutes, so the app should be stable and continue.


It would be useful to spoof the user agent (Though Miming different UserAgents may be more difficult using embedded browser).




If starting from scratch you may want to consider a C++ desktop app with an embedded webbrowser or EmbeddedWeb or Watin browser automation?


I will be advised by your expert choice.




There are possible additions to this app, though first I need to find a writer who can accomplish the above app.


Project Bids

Expert Created at Action
Win

Getwebhelp

08 Feb, 2010 \
Reject

Amitlogicnext

08 Feb, 2010 \
Reject

Levilai

08 Feb, 2010 \
Reject

Net4nuts

09 Feb, 2010 \
Reject

Teamsp

09 Feb, 2010 \

Project Buyer

Harold Jeffries
User Name:

Harold Jeffries

  • Member Since: 22 Jul, 2008
  • Project Posted: 45
  • Total Purchased: $ 1,610 USD

Project Expert

Getwebhelp
User Name:

Getwebhelp

  • Positive Rating:100%
  • Projects Won 17
  • Feedback 11
  • Portfolio 3
  • Earnings $ 2,590 USD

Place Bid

Login to submit a proposal.
Haven't signed up yet? Please sign up to become a Taskcity user right away.

Related Expert Spotlight

25 Mar, 2010
06 May, 2009
14 Jun, 2010
Privacy Policy | Terms of Service | About Us | Contact Us | Guest Book
Copyright (c) 2007-2010 Taskcity All rights reserved