Help get this topic noticed by sharing it on Twitter, Facebook, or email.

Feature Request: Develop files supported in search results

Feature Request: We have approximately 200 pdfs on our site (hccm.med.harvard.edu), and currently they are not returned in search results. I would like to request that files be supported in search results - to improve user experience and site functionality. Thank you for considering. . .
1 person likes
this idea
+1
Reply
  • Users should probably then be given an option to exclude files from their search results.

    I guess Search doesn't have a filter by content type feature yet (Page, Event, People, etc.) but if it did, adding Files as a filter option could be a way of handling both avoiding files in search results and looking only at files in search results.
  • (some HTML allowed)
    How does this make you feel?
    Add Image
    I'm

    e.g. kidding, amused, unsure, silly happy, confident, thankful, excited indifferent, undecided, unconcerned sad, anxious, confused, frustrated

  • I'm also a big believer in using the content type LINKS.  This way you can have your own text referencing the pdf and the text you use is searchable.  Also you can tag files and/or links. 
    • If you're attaching a file to a link post, what do you enter for the URL? You can attached a file to a Page and write text as well, you'd just have to make sure to use at least one tag and a List of Posts somewhere so there's a way to navigate to them whereas link posts automatically have the Links app as a collector.

      I think Joshua is looking for full text searching of uploaded PDFs something Google could do if his particular site wasn't restricted to the Harvard Community (my guess is because it has info about conducting animal-based research).

      I think the platform uses Apache Solr as its search engine, which can index PDFs, Word docs, spreadsheets, etc. However, it looks like it does so by extracting the text to a separate file so in addition to figuring out how to present files in search results alongside posts, things like, deleting extracted text when the file is deleted, replacing it when a replacement file is uploaded, etc. would have to be a part of development.
  • (some HTML allowed)
    How does this make you feel?
    Add Image
    I'm

    e.g. kidding, amused, unsure, silly happy, confident, thankful, excited indifferent, undecided, unconcerned sad, anxious, confused, frustrated