Forums / Extensions / eZ Find / eZFind and pdf indexing

"Please Note:
  • At the specific request of Ibexa we are changing this projects name to "Exponential" or "Exponential (CMS)" effective as of August, 11th 2025.
  • This project is not associated with the original eZ Publish software or its original developer, eZ Systems or Ibexa".

eZFind and pdf indexing

Author Message

Gwenal Le Bihan

Friday 19 September 2008 2:30:33 am

Hello,

I've been trying to index pdf files using eZ4.0.1, eZfind and this binaryfile parser http://ez.no/developer/articles/indexing_multiple_binary_file_types/installation and it's working ok except for too large pdf files.

I tried it on a 2,7Mo pdf file (approx 140000 car) and I have no error messages in any log and my files is indexed until a certain point and then nothing (approx 66% of the words are taken care of).

Is there a limit (size, number of caracters, ...) in some conf file like solrconfig.xml but i quite not understand everything in it and don't want to mess with it ...

Is there anybody out there with the same problem?

Thanks a lot

gwenal

Gwenal Le Bihan

Friday 19 September 2008 3:54:56 am

PS : In the ezbinaryparser.php, i've removed this line :

//               $sData = substr($sData, 0, $iCharacterLimit);

so that i don't have a partial index for my file during the original upload.