function readOnly(count){ }
Starting November 20, the site will be set to read-only. On December 4, 2023,
forum discussions will move to the Trailblazer Community.
+ Start a Discussion
Sachi7Sachi7 

Document Content Searchable - when does a document becomes searchable through its content?

I understand that enabling "Document Content Searchable" in Search settings enables to search through the content of a document and there isa  Flag : Document Content Searchable(IsBodySearchable) on every document record.

 

In the reference guide, it is mentioned when the document is completely processed by salesforce, it becomes available for search through its content.

 

I would like to know how long does it take to process the content of a document. I uploaded a .docx file with the content of only one word "chipmunks". Its been 2hrs now and still the particular document record's  Document Content Searchable(IsBodySearchable) flag is not set to TRUE.

 

Please let me know if there are any additional constraints to make a document searchable through its content.

Constraints such as File Type, MIME etc.

 

I also observed in my org that all the document records that has " Document Content Searchable" set to TRUE are made available for public use (IsPublic = TRUE). Please confirm if this is also an constraint.

Best Answer chosen by Admin (Salesforce Developers) 
Vinita_SFDCVinita_SFDC

Hi,

 

The document should be searchable within few minutes. Please consider the following best practice:

When you search using keywords, each word within the search query is considered as a keyword. However, using quotes around a group of words will treat that group of words as one keyword.When you perform search, the system will by default search for the specific keywords in the document name, keywords, and description. It is possible to search the contents of documents.However, it has to be enabled before the system searches the contents of the document. To enable the search for document contents, you will need to follow the
following steps.

Setup -> Customize -> Search -> Search Settings There, you will need to select the 'Enable Document Content Search
The file extensions that are supported for content search include HTML, XHTML, PDF, PPT, Text (.css, .java, .txt, etc.), DOC, XLS and XML. A couple of things to remember are:

Not all PDF documents are searchable. To be precise, it's those
PDF documents that are unable to locate a particular word when you use
Adobe's FIND utility in it. Those PDFs are mostly created from
non-textual data.
Sometimes PDF documents have security settings that make the contents non-searchable.
Changing file extension from one searchable document type (such as
.html) to another (such as .txt) will make the file unreadable to the
system, and so it will generate errors.

You are able to upload documents via the Data Loader, including the
keywords. Keywords are treated as text fields, and so you will have to
watch out for the length of those keywords.

Also if the file extension is .docx then it would not be searchable, this is a know issue and being worked upon by R & D.

Hope this helps!

All Answers

Vinita_SFDCVinita_SFDC

Hi,

 

The document should be searchable within few minutes. Please consider the following best practice:

When you search using keywords, each word within the search query is considered as a keyword. However, using quotes around a group of words will treat that group of words as one keyword.When you perform search, the system will by default search for the specific keywords in the document name, keywords, and description. It is possible to search the contents of documents.However, it has to be enabled before the system searches the contents of the document. To enable the search for document contents, you will need to follow the
following steps.

Setup -> Customize -> Search -> Search Settings There, you will need to select the 'Enable Document Content Search
The file extensions that are supported for content search include HTML, XHTML, PDF, PPT, Text (.css, .java, .txt, etc.), DOC, XLS and XML. A couple of things to remember are:

Not all PDF documents are searchable. To be precise, it's those
PDF documents that are unable to locate a particular word when you use
Adobe's FIND utility in it. Those PDFs are mostly created from
non-textual data.
Sometimes PDF documents have security settings that make the contents non-searchable.
Changing file extension from one searchable document type (such as
.html) to another (such as .txt) will make the file unreadable to the
system, and so it will generate errors.

You are able to upload documents via the Data Loader, including the
keywords. Keywords are treated as text fields, and so you will have to
watch out for the length of those keywords.

Also if the file extension is .docx then it would not be searchable, this is a know issue and being worked upon by R & D.

Hope this helps!

This was selected as the best answer
Shaun MajlingerShaun Majlinger
Has the .dox Issue been resolved?