Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
CODE PROJECT For Those Who Code
  • Home
  • Articles
  • FAQ
Community
  1. Home
  2. Web Development
  3. Site Indexing

Site Indexing

Scheduled Pinned Locked Moved Web Development
databasecomalgorithmsquestion
2 Posts 2 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • J Offline
    J Offline
    Jack Puppy
    wrote on last edited by
    #1

    Anyone have experience building a search index for documents on a Web site? I can't imagine I'll ever have more than a couple of hundred docs, most of which will be < 1500 words. All I was planning on doing was: - tally the # of word(s) in each document matching the search word(s) to determine a relevancy. - give more weight to words that appear in the document title (say 20x) - remove common words like "the", "an", etc. from the search These are the things I'm wondering about after browsing the web for info on the subject... - is it worth it to use stemming? (stay, stays, stayed, staying return the same results) - is it worth it to use a fancy word frequency algorithm that determines word weight on a per document basis?

    My 20 Favorite Films

    A 1 Reply Last reply
    0
    • J Jack Puppy

      Anyone have experience building a search index for documents on a Web site? I can't imagine I'll ever have more than a couple of hundred docs, most of which will be < 1500 words. All I was planning on doing was: - tally the # of word(s) in each document matching the search word(s) to determine a relevancy. - give more weight to words that appear in the document title (say 20x) - remove common words like "the", "an", etc. from the search These are the things I'm wondering about after browsing the web for info on the subject... - is it worth it to use stemming? (stay, stays, stayed, staying return the same results) - is it worth it to use a fancy word frequency algorithm that determines word weight on a per document basis?

      My 20 Favorite Films

      A Offline
      A Offline
      Ashraf Fathy
      wrote on last edited by
      #2

      u can use the indexing service, it will help u..it is so easy...u will be able to search for any word with a simple select statment Ashraf Fathy

      1 Reply Last reply
      0
      Reply
      • Reply as topic
      Log in to reply
      • Oldest to Newest
      • Newest to Oldest
      • Most Votes


      • Login

      • Don't have an account? Register

      • Login or register to search.
      • First post
        Last post
      0
      • Categories
      • Recent
      • Tags
      • Popular
      • World
      • Users
      • Groups