I do contract work for a living, which could include writing a program such as this. However, I don’t do people’s homework for them. That just robs them of an education.
You have my full permission to implement this project any way you please.
I don’t mean the web page was updated or created between those dates, I mean that the web page contains an internal date, e.g. a sentence of the form: “Mr. Bush said that he was totally innocent of any wrongdoing an Abu Ghraib on January 3, 2005. It was all the fault of a few bad apples.”
When you are spidering the web to create the indexes, you have to be clever about plucking dates. The article might have a date of the form 01/19/2005 and the date reference might be “last wednesday“ You have to handle all the many forms of dates and date references embedded in aricles and convert them to standard YYYY-MM-DD iso form.
You create a list of dates and the offsets at which they occur in the document.
You could demonstrate this with a local database of articles randomly selected from the web. Then show your demo to each of the search engine companies and see if you can sell your date-plucking code for them to incorporate. It had better be quick. Every document on the web will be repeatedly run through your algorithm.
![]() |
and suggestions to improve this page to Roedy Green : | ||
| Canadian Mind Products | |||
| mindprod.com IP:[65.110.21.43] | |||
| Your face IP:[38.103.63.17] | The information on this page is for non-military use only. | ||
| You are visitor number 2,896. | Military use includes use by defence contractors. | ||
| You can get a fresh copy of this page from: | or possibly from your local J: drive (Java virtual drive/Mindprod website mirror) | ||
| http://mindprod.com/project/datesearchengine.html | J:\mindprod\project\datesearchengine.html | ||