IFilter broken for .docx files


it known microsoft ifilter .docx broken - when using offfiltx.dll ifilter extract text word document ignores line breaks , therefore extracting text following 2 line document snippet:

i computer

and i

returns "i computerand i"

this problem has been documented on several years , yet latest ifilter pack can find (office 2010 updatedin 2015) *still* has same deficiency.

as ifilters seemingly used within microsoft's own products don't understand how such fundamental deficiency has not yet been addressed , fixed.

will ever fixed? have ifilters been deprecated? see nothing saying have of ifilter packs support max win 2007 / server 2008 in ms doc on web.

it's not correct forum question.

and specific issue, yes, ifilter's gettext method returns text without line breaks. should resolved ifilters api. 

msdn.microsoft.com/en-us/library/ms690992%28vs.85%29.aspx

https://msdn.microsoft.com/en-us/library/ms692540(v=vs.85).aspx


code in thread mihgt helpful: http://stackoverflow.com/questions/1939187/word-ifilter-for-docx-parser-error


Windows Server  >  Windows Server General Forum



Comments

Popular posts from this blog

2008 Windows Deployment Server Properties Error

Can no longer user MS Update - Files required to use Microsoft Update are no longer registered

How do a find data in one file, search for it in another file and if not found, write a custom message to another file