Show Bid Request
Domain Specific WebSearch Engine
Bid Request Id: 46039
|
|
|
Posted by: |
StarSE (1 ratings)
(Software buyer rating 10)
|
Non-action Ratio: |
Below Average - 50.00%
|
Posted: |
Jan 30, 2003 2:06:18 PM EDT
|
Bidding Closes: |
Feb 13, 2003 2:12:26 PM EDT
|
Viewed (by coders): |
226 times
|
Deadline: |
2/20/2003
TIME EXPIRED
|
|
|
|
Description:
Name: Web Search Engine Softwares: Java, C, C++, Perl, any software Database: Access, mysql
Main pieces of the Project 1. Inverted List Index 2. Store Index (built in Step1) in database or files 3. Searching Index built in step 1 4. Ranking Algorithm 5. Input Screen and Parsing 6. Display Output
Brief Explanantion: Project Description: Domain specific search engine.
5. Input Screen: Accept input from user and parse the string extract keywords to search. 1. Inverted List Index: This would be like a nightly process. Build a index database of all the keywords of the domain. Could restrict domain to one website like www.biology.edu. Have to use Inverted list index algorithms to build the index. 2. Store Index: The above built index has to be stored in a database or on a file. Sort it and keep it ready for the search process to aquire results 3. Searching Index: Keywords extractected in step 5 should be used to search the index and get pointers or links ( also stored in the database) to location of the webpage 4. Ranking Algorithm: Can use Boolean, Vector or Probabalistic models to rank the results of the search process 6. Display Output: After the ranking is done, the output has to be displayed best match first.
Deliverables: 1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Installation package that will install the software (in ready-to-run condition) on the platform(s) specified in this bid request.
3) Complete ownership and distribution copyrights to all work purchased.
Platform:
Softwares: Java, C, C++, Perl, any software Database: Access, mysql
Must be 100% finished and received by buyer on:
Feb 20, 2003 EDT
Deadline legal notes: All times are expressed in the time zone of the site EDT (UT - 5). If the buyer omitted a time, then the deadline is 11:59:59 PM EDT on the indicated date.
Additional Files:
This bid request includes IMPORTANT additional attached files. Please download and read fully before bidding.
Remember that contacting the other party outside of the site (by email, phone, etc.) on all business projects < $500 (before the buyer's money is escrowed) is a violation of both the software buyer and seller agreements.
We monitor all site activity for such violations and can instantly expel transgressers on the spot, so we thank you in advance for your cooperation.
If you notice a violation please help out the site and report it. Thanks for your help.
|
|
Bidding/Comments:
|
All monetary amounts on the site are in United States dollars.
Rent a Coder is a closed auction, so coders can only see their own bids and comments. Buyers can view every posting made on their bid requests. |
See all rejected bids (and all comments)
Name |
Bid Amount |
Date |
Coder Rating |
|
|
This bid was accepted by the buyer!
|
$120 (USD)
|
Jan 30, 2003 5:53:02 PM EDT
|
10
(Excellent)
|
|
|
My solution is based on a MySQL database and three major components:
* spider (traverses all documents for a site) * indexer (creates / updates the index) * query interface (for parsing queries and displaying results)
The spider and indexer would be developed in Perl, while the query interface could be built in Perl or Java, depending on your needs.
Should the spider be site friendly (obey robots.txt) and should only HTML pages be downloaded for indexing?
Should the indexer look at text only, or should META keywords be used as well? What about stop words, embedded punctuation (e.g. 'short-term' , 'VAX/VMS') and morphology ('interesting', 'interests', 'interested',...).
I assume that you want a web interface for the query interface. I will produce a basic design for this if no design guidelines exist.
I have (as you can see from my resume) previously developed a free text indexer and query interface for a large news database.
I would develop on Linux, but could test on windows as well, if needed. My bid of $120 is based on the above questions being answered. I will need about a week from start to finish.
Hope to hear from you.
|
|
|
|
|
|