IFilter broken for .docx files
it known microsoft ifilter .docx broken - when using offfiltx.dll ifilter extract text word document ignores line breaks , therefore extracting text following 2 line document snippet:
i computer
and i
returns "i computerand i"
this problem has been documented on several years , yet latest ifilter pack can find (office 2010 updatedin 2015) *still* has same deficiency.
as ifilters seemingly used within microsoft's own products don't understand how such fundamental deficiency has not yet been addressed , fixed.
will ever fixed? have ifilters been deprecated? see nothing saying have of ifilter packs support max win 2007 / server 2008 in ms doc on web.
it's not correct forum question.
and specific issue, yes, ifilter's gettext method returns text without line breaks. should resolved ifilters api.
msdn.microsoft.com/en-us/library/ms690992%28vs.85%29.aspx
https://msdn.microsoft.com/en-us/library/ms692540(v=vs.85).aspx
code in thread mihgt helpful: http://stackoverflow.com/questions/1939187/word-ifilter-for-docx-parser-error
Windows Server > Windows Server General Forum
Comments
Post a Comment