Solr Background Method Server throws errors and memory issue when indexing some xlsx, xlsm files in Windchill
Applies To
- Windchill PDMLink 10.0 F000 to M050
- Windchill PDMLink 10.1 F000 to M040
Description
- Solr Background Method Server stops when indexing some xlsx files (Excel Workbook)
- Out of memory error message is thrown before stopping
- Solr Background Method Server logs contain: java.lang.OutOfMemoryError: GC overhead limit exceeded
- Indexing some xlsm (Excel MacroEnabled) throws below error message in logs:
ERROR : com.ptc.search.solr.contentReader.contentExtraction.TikaExtractor Administrator - Error parsing content of file filename.xlsm
org.apache.tika.exception.TikaException: Error creating OOXML extractor
at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:90)
at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:67)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:137)
at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:150)
at com.ptc.search.solr.contentReader.contentExtraction.TikaExtractor.extractContent(TikaExtractor.java:47)
at com.ptc.search.solr.contentReader.contentExtraction.StreamExtractor.run(StreamExtractor.java:69)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
Caused by: org.apache.xmlbeans.impl.values.XmlValueOutOfRangeException: string value 'Text' is not a valid enumeration value for ST_ObjectType in namespace urn:schemas-microsoft-com:office:excel
at org.apache.xmlbeans.impl.values.JavaStringEnumerationHolderEx.set_text(JavaStringEnumerationHolderEx.java:39)
at org.apache.xmlbeans.impl.values.XmlObjectBase.update_from_wscanon_text(XmlObjectBase.java:1169)
at org.apache.xmlbeans.impl.values.XmlObjectBase.check_dated(XmlObjectBase.java:1308)
at org.apache.xmlbeans.impl.values.JavaStringEnumerationHolderEx.getEnumValue(JavaStringEnumerationHolderEx.java:60)
at schemasMicrosoftComOfficeExcel.impl.CTClientDataImpl.getObjectType(Unknown Source)