dtSearch has introduced a model 2026.01 beta that simplifies how customers see highlighted search ends in PDF information. The brand new launch eliminates the necessity for a separate PDF highlighter plug-in, a change that applies to dtSearch enterprise and developer merchandise, together with SDKs for Home windows, Linux, and macOS. These merchandise search terabytes of combined on-line and offline knowledge immediately, operating on premises or within the cloud, corresponding to on Azure or AWS.
The principle function of the brand new model is improved PDF hit highlighting. The brand new course of highlights search hits by including annotations on to the PDF file. This implies PDF information now work like different supported knowledge varieties—corresponding to Microsoft Workplace information and emails with attachments—displaying information with multicolor hit highlighting for any variety of concurrent customers.
dtSearch proprietor David Thede informed SD Instances in an interview that the previous strategy of utilizing an Adobe Acrobat Reader plug-in turned more and more untenable in a browser setting. The brand new technique offers a a lot cleaner method for folks so as to add PDF highlighting of their functions. Thede defined how the system modified: “The important thing to getting that work is that we wanted to have the ability to add the highlights as annotations within the pdf file, so reasonably than producing html from pdf, we take an present pdf and we stick the annotations on it, after which serve that.”
Within the new model, dtSearch has a strategy to work with browsers that use the open-source pdf.js mission, Thede mentioned. The Firefox browser, like many browsers, have JavaScript-based PDF viewers primarily based on that mission. “So, in our dtSearch desktop product we will embed a viewer window that has pdf.js used to show the pdf file. We are able to do the hit navigation and the hit highlighting on high of that, however we will additionally do it in our web-based merchandise.”
dtSearch merchandise embrace a Terabyte Indexer that may index a terabyte of textual content throughout many sources, together with emails with nested attachments and on-line knowledge. Listed search is often instantaneous, even when masking terabytes of information with concurrent customers. The product line affords over 25 search options, together with full-text and metadata choices. It helps Unicode for lots of of worldwide languages and affords forensics-oriented choices. SDKs can be found for C++, Java, and .NET APIs, and so they assist databases like SQL and NoSQL.
Thede harassed the worth of the brand new PDF function. He mentioned, “Having the ability to spotlight hits in PDF information after a search is a really good factor to have the ability to do, as a result of PDF is so broadly used”. He famous that it is a large time saver for professionals, corresponding to legal professionals reviewing lengthy paperwork1
Concerning AI integration, Thede confirmed that dtSearch doesn’t embrace AI in its merchandise. He famous this resolution is tied to buyer safety issues: “Our prospects are usually establishments which are extraordinarily involved about confidentiality”. Nonetheless, Thede added that dtSearch plans to have a look at methods to present customers the instruments to attach their search outcomes with AI after they select to take action.
