

dtSearch has introduced a model 2026.01 beta that simplifies how customers see highlighted search ends in PDF information. The brand new launch eliminates the necessity for a separate PDF highlighter plug-in, a change that applies to dtSearch enterprise and developer merchandise, together with SDKs for Home windows, Linux, and macOS. These merchandise search terabytes of blended on-line and offline knowledge immediately, working on premises or within the cloud, equivalent to on Azure or AWS.
The primary function of the brand new model is improved PDF hit highlighting. The brand new course of highlights search hits by including annotations on to the PDF file. This implies PDF information now work like different supported knowledge sorts—equivalent to Microsoft Workplace information and emails with attachments—displaying information with multicolor hit highlighting for any variety of concurrent customers.
dtSearch proprietor David Thede informed SD Instances in an interview that the previous strategy of utilizing an Adobe Acrobat Reader plug-in grew to become more and more untenable in a browser surroundings. The brand new technique supplies a a lot cleaner manner for folks so as to add PDF highlighting of their functions. Thede defined how the system modified: “The important thing to getting that work is that we wanted to have the ability to add the highlights as annotations within the pdf file, so reasonably than producing html from pdf, we take an present pdf and we stick the annotations on it, after which serve that.”
Within the new model, dtSearch has a technique to work with browsers that use the open-source pdf.js undertaking, Thede stated. The Firefox browser, like many browsers, have JavaScript-based PDF viewers primarily based on that undertaking. “So, in our dtSearch desktop product we will embed a viewer window that has pdf.js used to show the pdf file. We will do the hit navigation and the hit highlighting on high of that, however we will additionally do it in our web-based merchandise.”
dtSearch merchandise embody a Terabyte Indexer that may index a terabyte of textual content throughout many sources, together with emails with nested attachments and on-line knowledge. Listed search is often instantaneous, even when masking terabytes of knowledge with concurrent customers. The product line affords over 25 search options, together with full-text and metadata choices. It helps Unicode for a whole bunch of worldwide languages and affords forensics-oriented choices. SDKs can be found for C++, Java, and .NET APIs, they usually help databases like SQL and NoSQL.
Thede confused the worth of the brand new PDF function. He stated, “Having the ability to spotlight hits in PDF information after a search is a really good factor to have the ability to do, as a result of PDF is so broadly used”. He famous that it is a big time saver for professionals, equivalent to legal professionals reviewing lengthy paperwork1
Concerning AI integration, Thede confirmed that dtSearch doesn’t embody AI in its merchandise. He famous this choice is tied to buyer safety considerations: “Our clients are typically establishments which might be extraordinarily involved about confidentiality”. Nevertheless, Thede added that dtSearch plans to take a look at methods to provide customers the instruments to attach their search outcomes with AI after they select to take action.
