没有找到合适的产品?
联系客服协助选型:023-68661681
提供3000多款全球软件/控件产品
针对软件研发的各个阶段提供专业培训与技术咨询
根据客户需求提供定制化的软件开发服务
全球知名设计软件,显著提升设计质量
打造以经营为中心,实现生产过程透明化管理
帮助企业合理产能分配,提高资源利用率
快速打造数字化生产线,实现全流程追溯
生产过程精准追溯,满足企业合规要求
以六西格玛为理论基础,实现产品质量全数字化管理
通过大屏电子看板,实现车间透明化管理
对设备进行全生命周期管理,提高设备综合利用率
实现设备数据的实时采集与监控
利用数字化技术提升油气勘探的效率和成功率
钻井计划优化、实时监控和风险评估
提供业务洞察与决策支持实现数据驱动决策
原创|行业资讯|编辑:龚雪|2015-03-11 13:09:15.000|阅读 218 次
概述:很多人听说过企业搜索,但很少有人知道企业搜索的底层其实是文档过滤器。
# 界面/图表报表/文档/IDE等千款热门软控件火热销售中 >>
IF YOU LOOKED at a Microsoft Word file in binary format (as a search engine needs to review it), the file structure is so complex as to make it nearly impossible to pick out the text. In fact, MS Word documents include not only body text but also fields and often even hidden meta data. And MS Word files can have a nested structure, embedding multiple layers of other documents within the Word file.
Delving through these levels of complexity requires a programmatic implementation embedding a deep understanding of file structure. That is the job of document filters.
Document filters are a dynamic component. Every update, for example, that Microsoft makes to the MS Word format requires an adjustment to the document filters going forward, while still preserving backward compatibility with existing Word files.
One leading supplier of enterprise and developer text search software, dtSearch Corp., has spent over two decades building its own document filters. And the company continually upgrades its document filters to correspond with the release of new data formats.
In addition to Word, other MS Office file types that dtSearch supports include PowerPoint, Excel, Access, and OneNote. The document filters also support PDF, RTF, OpenOffice, HTML, XML, CSV, and many other file types, along with compression formats like RAR, ZIP, and GZIP/TAR. And the dtSearch document filters support recursively embedded versions of files, such as a Word file embedded in an Excel file contained in a ZIP attachment.
The dtSearch document filters can also support browser-compatible images in files, including recursively embedded files. The document filters further include Unicode support covering hundreds of international languages.
With so much data now in emails, the dtSearch document filters also support email formats like MS Outlook, Exchange, and Thunderbird. And support extends beyond the email body and meta data to cover multi-layered nested attachments, including recursively-embedded images.
The dtSearch Engine APIs can also work with database data like SQL. While SQL itself is not a file format, it can include BLOB data consisting of embedded documents. The same integrated support for recursively embedded documents, meta data, images, and the like apply to this BLOB data.
Finally, the dtSearch Spider supports static and dynamic Web data (SharePoint, PHP, ASP.NET, CMS, etc.). This data can further consist of (or simply embed) document data such as HTML, PDF, XSL/XML, or even Office files, all of which require the document filters.
dtSearch enterprise and developer products can index more than a terabyte of data in a single index. A single index can span multiple file directories, emails and attachments, online data, and other databases. The products can create and search any number of indexes.
After indexing, the product line supports highly concurrent, multithreaded searching. Indexed search time is typically less than a second, even across terabytes of data. dtSearch products offer more than 25 search options.
For federated searching, dtSearch products support integrated relevancy ranking across both online and offline repositories. Following a search, the document filters enable hit-highlighting of federated search content.
In the dtSearch Engine, API filters and objects provide an even wider range of advanced data classification options. SDKs include native 64-bit and 32-bit APIs for C++, Java, and .NET (through current versions).
本站文章除注明转载外,均为本站原创或翻译。欢迎任何形式的转载,但请务必注明出处、不得修改原文相关链接,如果存在内容上的异议请邮件反馈至chenjj@evget.com
文章转载自:慧都控件网面对UI测试脚本散落、执行结果追溯困难、团队协作效率低下等问题,技术管理者面临的不仅是工具层面的挑战,更是工程效能的关键瓶颈。本文深度解析如何通过将自动化测试工具TestComplete与测试管理平台Zephyr Enterprise进行深度集成,构建端到端的UI测试管控体系,有效实现质量左移并大幅提升部署信心。
企业在应对复杂业务和庞大系统时,常面临业务需求传递不清、软件设计维护困难、以及跨团队协作验证滞后三大痛点。企业建模工具Sparx EA提供三大核心技巧解决这些问题:业务层锚定确保用标准图表清晰表达需求;软件设计与可视化让系统结构一目了然并支持代码反推模型;系统层验证支持早期模拟检查降低风险。
近日,AG Grid 正式发布 34.1 版本,本次更新以“提升开发者生产力、简化测试流程、增强布局与样式控制”为核心目标,带来了多项实用功能与体验优化。
金融行业的支付、清算和核心账务系统,承载着海量用户的实时交易和高并发访问。TestComplete的并行测试方案正在重新定义质量保障的标准,从千人并发模拟到跨浏览器验证,再到持续集成下的自动化回归,这套完整的测试体系使金融系统实现了从"被动防御故障"到"主动保障质量"的转变。
全球领先的文本检索工具,支持在千兆字节数量级的数据源中进行搜索。
dtSearch Network with Spider全球领先的文本检索工具,支持在千兆字节数量级的数据源中进行搜索。
dtSearch Web with Spider全球领先的文本检索工具,能够快速地将大量的搜索内容即时发布到基于IIS的Web站点上。
dtSearch Publish全球领先的文本检索工具,能够为CD/DVD publishing提供强大的功能。
dtSearch Engine超过20年的全球领先的文本检索控件,使开发者为应用程序快速添加文本查检索功能。
服务电话
重庆/ 023-68661681
华东/ 13452821722
华南/ 18100878085
华北/ 17347785263
客户支持
技术支持咨询服务
服务热线:400-700-1020
邮箱:sales@evget.com
关注我们
地址 : 重庆市九龙坡区火炬大道69号6幢
慧都科技 版权所有 Copyright 2003-
2025 渝ICP备12000582号-13 渝公网安备
50010702500608号