Couldn't find hyphenation pattern en_US

I experiment with transforming TEI to PDF using the file http://tei.oucs.ox.ac.uk/ENRICH/gBank/gBank.xml?style=raw The first error I get is --8<---------------cut here---------------start------------->8--- SystemID: /home/jsbien/ENRICH/all/gBank.xml Engine name: Saxon-B 9.1.0.7 Severity: error Description: The process 'FOP' ended with code: 1. The error was: 2010-05-04 14:32:51 org.apache.fop.hyphenation.Hyphenator getHyphenationTree SEVERE: Couldn't find hyphenation pattern en_US --8<---------------cut here---------------end--------------->8--- The problem seems independent of the current locale. What is the remedy? Best regards JSB -- , dr hab. Janusz S. Bien, prof. UW - Uniwersytet Warszawski (Katedra Lingwistyki Formalnej) Prof. Janusz S. Bien - Warsaw University (Department of Formal Linguistics) jsbien@uw.edu.pl, jsbien@mimuw.edu.pl, http://fleksem.klf.uw.edu.pl/~jsbien/

Hello, The remedy is to add a jar file with the hyphenation files for the language of the TEI file (en_US) to the [Oxygen-install-folder]/lib folder. Apache FOP does not embed hyphenation files due to licensing restrictions but these files can be downloaded from the OFFO project: http://offo.sourceforge.net/hyphenation/index.html However that error does not stop FOP processing on my computer in Oxygen 10.3 (which includes Saxon-B 9.1.0.7) and in later Oxygen versions. I get the following error in the Apache FOP output: SEVERE: Couldn't find hyphenation pattern en_US but the PDF output file is generated. I just saved the content of http://tei.oucs.ox.ac.uk/ENRICH/gBank/gBank.xml?style=raw as a local XML file and applied the predefined TEI P5 PDF transformation scenario. I get the same result if I apply the scenario directly to the URL. What Oxygen build number do you use? It is available in the Help -> About dialog. Do you run a custom transformation scenario? Did you set an Apache FOP configuration file in Preferences? Do you get the same result if you open the URL from menu File -> Open URL and apply the same scenario? Regards, Sorin On 5/4/2010 8:23 PM, Janusz S. Bień wrote:
I experiment with transforming TEI to PDF using the file
http://tei.oucs.ox.ac.uk/ENRICH/gBank/gBank.xml?style=raw
The first error I get is
--8<---------------cut here---------------start------------->8--- SystemID: /home/jsbien/ENRICH/all/gBank.xml Engine name: Saxon-B 9.1.0.7 Severity: error Description: The process 'FOP' ended with code: 1. The error was: 2010-05-04 14:32:51 org.apache.fop.hyphenation.Hyphenator getHyphenationTree SEVERE: Couldn't find hyphenation pattern en_US --8<---------------cut here---------------end--------------->8---
The problem seems independent of the current locale. What is the remedy?
Best regards
JSB

On Wed, 05 May 2010 Sorin Ristache <sorin@oxygenxml.com> wrote:
Hello,
The remedy is to add a jar file with the hyphenation files for the language of the TEI file (en_US) to the [Oxygen-install-folder]/lib folder. Apache FOP does not embed hyphenation files due to licensing restrictions but these files can be downloaded from the OFFO project:
As this appeared not crucial, I will do it later.
However that error does not stop FOP processing on my computer in Oxygen 10.3 (which includes Saxon-B 9.1.0.7) and in later Oxygen versions. I get the following error in the Apache FOP output:
SEVERE: Couldn't find hyphenation pattern en_US
but the PDF output file is generated. I just saved the content of
http://tei.oucs.ox.ac.uk/ENRICH/gBank/gBank.xml?style=raw
as a local XML file and applied the predefined TEI P5 PDF transformation scenario. I get the same result if I apply the scenario directly to the URL.
I *can* reproduce this and generate the document *without images".
What Oxygen build number do you use? It is available in the Help -> About dialog.
XML Editor 10.3, build 2009121115
Do you run a custom transformation scenario?
No, I use Document -> Transformation -> Apply Transformation Scenario. Earlier I've somehow preselected TEI P5 PDF, and I'm no longer asked to choose the scenario.
Did you set an Apache FOP configuration file in Preferences?
No, I even don't know where exactly this option is located.
Do you get the same result if you open the URL from menu File -> Open URL and apply the same scenario?
I have all the images including even those missing in http://tei.oucs.ox.ac.uk/ENRICH/gBank/gBankImages.zip This seems to make the difference. Enclosed please find the full list of error messages (I've removed warnings by hand). Best regards Janusz --8<---------------cut here---------------start------------->8--- SystemID: /home/jsbien/ENRICH/all/gBank.xml Engine name: Saxon-B 9.1.0.7 Severity: error Description: The process 'FOP' ended with code: 1. The error was: 2010-05-06 17:23:42 org.apache.fop.hyphenation.Hyphenator getHyphenationTree SEVERE: Couldn't find hyphenation pattern en_US 2010-05-06 17:23:43 org.apache.fop.fo.FONode attributeWarning [...] SEVERE: Error while rendering page 9 java.lang.UnsupportedOperationException: Unsupported DataBuffer type: java.awt.image.DataBufferUShort at org.apache.fop.pdf.AlphaRasterImage.outputContents(AlphaRasterImage.java:169) at org.apache.fop.pdf.PDFImageXObject.outputRawStreamData(PDFImageXObject.java:145) at org.apache.fop.pdf.AbstractPDFStream.encodeAndWriteStream(AbstractPDFStream.java:155) at org.apache.fop.pdf.AbstractPDFStream.output(AbstractPDFStream.java:199) at org.apache.fop.pdf.PDFImageXObject.output(PDFImageXObject.java:65) at org.apache.fop.pdf.PDFDocument.output(PDFDocument.java:937) at org.apache.fop.render.pdf.PDFRenderer.putImage(PDFRenderer.java:1767) at org.apache.fop.render.pdf.PDFRenderer.renderImage(PDFRenderer.java:1679) at org.apache.fop.render.AbstractRenderer.renderViewport(AbstractRenderer.java:743) at org.apache.fop.render.AbstractPathOrientedRenderer.renderViewport(AbstractPathOrientedRenderer.java:621) at org.apache.fop.render.AbstractRenderer.renderInlineArea(AbstractRenderer.java:626) at org.apache.fop.render.pdf.PDFRenderer.renderInlineArea(PDFRenderer.java:1345) at org.apache.fop.render.AbstractRenderer.renderLineArea(AbstractRenderer.java:601) at org.apache.fop.render.pdf.PDFRenderer.renderLineArea(PDFRenderer.java:1336) at org.apache.fop.render.AbstractRenderer.renderBlocks(AbstractRenderer.java:536) at org.apache.fop.render.AbstractRenderer.renderBlock(AbstractRenderer.java:573) at org.apache.fop.render.pdf.PDFRenderer.renderBlock(PDFRenderer.java:1329) at org.apache.fop.render.AbstractRenderer.renderBlocks(AbstractRenderer.java:526) at org.apache.fop.render.AbstractRenderer.renderBlock(AbstractRenderer.java:573) at org.apache.fop.render.pdf.PDFRenderer.renderBlock(PDFRenderer.java:1329) at org.apache.fop.render.AbstractRenderer.renderBlocks(AbstractRenderer.java:526) at org.apache.fop.render.AbstractPathOrientedRenderer.renderReferenceArea(AbstractPathOrientedRenderer.java:548) at org.apache.fop.render.AbstractRenderer.renderBlock(AbstractRenderer.java:560) at org.apache.fop.render.pdf.PDFRenderer.renderBlock(PDFRenderer.java:1329) at org.apache.fop.render.AbstractRenderer.renderBlocks(AbstractRenderer.java:526) at org.apache.fop.render.AbstractRenderer.renderBlock(AbstractRenderer.java:573) at org.apache.fop.render.pdf.PDFRenderer.renderBlock(PDFRenderer.java:1329) at org.apache.fop.render.AbstractRenderer.renderBlocks(AbstractRenderer.java:526) at org.apache.fop.render.AbstractRenderer.renderFlow(AbstractRenderer.java:424) at org.apache.fop.render.AbstractPathOrientedRenderer.renderFlow(AbstractPathOrientedRenderer.java:577) at org.apache.fop.render.AbstractRenderer.renderMainReference(AbstractRenderer.java:403) at org.apache.fop.render.AbstractRenderer.renderBodyRegion(AbstractRenderer.java:337) at org.apache.fop.render.AbstractRenderer.renderRegionViewport(AbstractRenderer.java:282) at org.apache.fop.render.AbstractRenderer.renderPageAreas(AbstractRenderer.java:255) at org.apache.fop.render.AbstractRenderer.renderPage(AbstractRenderer.java:229) at org.apache.fop.render.pdf.PDFRenderer.renderPage(PDFRenderer.java:801) at org.apache.fop.area.RenderPagesModel.addPage(RenderPagesModel.java:113) at org.apache.fop.layoutmgr.AbstractPageSequenceLayoutManager.finishPage(AbstractPageSequenceLayoutManager.java:312) at org.apache.fop.layoutmgr.PageSequenceLayoutManager.finishPage(PageSequenceLayoutManager.java:167) at org.apache.fop.layoutmgr.AbstractPageSequenceLayoutManager.makeNewPage(AbstractPageSequenceLayoutManager.java:284) at org.apache.fop.layoutmgr.PageBreaker.handleBreakTrait(PageBreaker.java:494) at org.apache.fop.layoutmgr.PageBreaker.startPart(PageBreaker.java:398) at org.apache.fop.layoutmgr.AbstractBreaker.addAreas(AbstractBreaker.java:421) at org.apache.fop.layoutmgr.AbstractBreaker.addAreas(AbstractBreaker.java:371) at org.apache.fop.layoutmgr.PageBreaker.doPhase3(PageBreaker.java:262) at org.apache.fop.layoutmgr.AbstractBreaker.doLayout(AbstractBreaker.java:346) at org.apache.fop.layoutmgr.AbstractBreaker.doLayout(AbstractBreaker.java:264) at org.apache.fop.layoutmgr.PageSequenceLayoutManager.activateLayout(PageSequenceLayoutManager.java:106) at org.apache.fop.area.AreaTreeHandler.endPageSequence(AreaTreeHandler.java:234) at org.apache.fop.fo.pagination.PageSequence.endOfNode(PageSequence.java:123) at org.apache.fop.fo.FOTreeBuilder$MainFOHandler.endElement(FOTreeBuilder.java:340) at org.apache.fop.fo.FOTreeBuilder.endElement(FOTreeBuilder.java:169) at net.sf.saxon.event.ContentHandlerProxy.endElement(ContentHandlerProxy.java:391) at net.sf.saxon.event.NamespaceReducer.endElement(NamespaceReducer.java:213) at net.sf.saxon.event.ReceivingContentHandler.endElement(ReceivingContentHandler.java:443) at org.apache.xerces.parsers.AbstractSAXParser.endElement(Unknown Source) at org.apache.xerces.xinclude.XIncludeHandler.endElement(Unknown Source) at org.apache.xerces.impl.XMLNSDocumentScannerImpl.scanEndElement(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source) at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source) at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source) at org.apache.xerces.parsers.XMLParser.parse(Unknown Source) at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source) at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source) at net.sf.saxon.event.Sender.sendSAXSource(Sender.java:404) at net.sf.saxon.event.Sender.send(Sender.java:193) at net.sf.saxon.IdentityTransformer.transform(IdentityTransformer.java:30) at org.apache.fop.cli.InputHandler.transformTo(InputHandler.java:214) at org.apache.fop.cli.InputHandler.renderTo(InputHandler.java:125) at org.apache.fop.cli.Main.startFOP(Main.java:166) at org.apache.fop.cli.Main.main(Main.java:197) 2010-05-06 17:23:46 org.apache.fop.cli.Main startFOP SEVERE: Exception java.lang.UnsupportedOperationException: Unsupported DataBuffer type: java.awt.image.DataBufferUShort at org.apache.fop.cli.InputHandler.transformTo(InputHandler.java:217) at org.apache.fop.cli.InputHandler.renderTo(InputHandler.java:125) at org.apache.fop.cli.Main.startFOP(Main.java:166) at org.apache.fop.cli.Main.main(Main.java:197) --------- java.lang.UnsupportedOperationException: Unsupported DataBuffer type: java.awt.image.DataBufferUShort at org.apache.fop.pdf.AlphaRasterImage.outputContents(AlphaRasterImage.java:169) at org.apache.fop.pdf.PDFImageXObject.outputRawStreamData(PDFImageXObject.java:145) at org.apache.fop.pdf.AbstractPDFStream.encodeAndWriteStream(AbstractPDFStream.java:155) at org.apache.fop.pdf.AbstractPDFStream.output(AbstractPDFStream.java:199) at org.apache.fop.pdf.PDFImageXObject.output(PDFImageXObject.java:65) at org.apache.fop.pdf.PDFDocument.output(PDFDocument.java:937) at org.apache.fop.render.pdf.PDFRenderer.putImage(PDFRenderer.java:1767) at org.apache.fop.render.pdf.PDFRenderer.renderImage(PDFRenderer.java:1679) at org.apache.fop.render.AbstractRenderer.renderViewport(AbstractRenderer.java:743) at org.apache.fop.render.AbstractPathOrientedRenderer.renderViewport(AbstractPathOrientedRenderer.java:621) at org.apache.fop.render.AbstractRenderer.renderInlineArea(AbstractRenderer.java:626) at org.apache.fop.render.pdf.PDFRenderer.renderInlineArea(PDFRenderer.java:1345) at org.apache.fop.render.AbstractRenderer.renderLineArea(AbstractRenderer.java:601) at org.apache.fop.render.pdf.PDFRenderer.renderLineArea(PDFRenderer.java:1336) at org.apache.fop.render.AbstractRenderer.renderBlocks(AbstractRenderer.java:536) at org.apache.fop.render.AbstractRenderer.renderBlock(AbstractRenderer.java:573) at org.apache.fop.render.pdf.PDFRenderer.renderBlock(PDFRenderer.java:1329) at org.apache.fop.render.AbstractRenderer.renderBlocks(AbstractRenderer.java:526) at org.apache.fop.render.AbstractRenderer.renderBlock(AbstractRenderer.java:573) at org.apache.fop.render.pdf.PDFRenderer.renderBlock(PDFRenderer.java:1329) at org.apache.fop.render.AbstractRenderer.renderBlocks(AbstractRenderer.java:526) at org.apache.fop.render.AbstractPathOrientedRenderer.renderReferenceArea(AbstractPathOrientedRenderer.java:548) at org.apache.fop.render.AbstractRenderer.renderBlock(AbstractRenderer.java:560) at org.apache.fop.render.pdf.PDFRenderer.renderBlock(PDFRenderer.java:1329) at org.apache.fop.render.AbstractRenderer.renderBlocks(AbstractRenderer.java:526) at org.apache.fop.render.AbstractRenderer.renderBlock(AbstractRenderer.java:573) at org.apache.fop.render.pdf.PDFRenderer.renderBlock(PDFRenderer.java:1329) at org.apache.fop.render.AbstractRenderer.renderBlocks(AbstractRenderer.java:526) at org.apache.fop.render.AbstractRenderer.renderFlow(AbstractRenderer.java:424) at org.apache.fop.render.AbstractPathOrientedRenderer.renderFlow(AbstractPathOrientedRenderer.java:577) at org.apache.fop.render.AbstractRenderer.renderMainReference(AbstractRenderer.java:403) at org.apache.fop.render.AbstractRenderer.renderBodyRegion(AbstractRenderer.java:337) at org.apache.fop.render.AbstractRenderer.renderRegionViewport(AbstractRenderer.java:282) at org.apache.fop.render.AbstractRenderer.renderPageAreas(AbstractRenderer.java:255) at org.apache.fop.render.AbstractRenderer.renderPage(AbstractRenderer.java:229) at org.apache.fop.render.pdf.PDFRenderer.renderPage(PDFRenderer.java:801) at org.apache.fop.area.RenderPagesModel.addPage(RenderPagesModel.java:113) at org.apache.fop.layoutmgr.AbstractPageSequenceLayoutManager.finishPage(AbstractPageSequenceLayoutManager.java:312) at org.apache.fop.layoutmgr.PageSequenceLayoutManager.finishPage(PageSequenceLayoutManager.java:167) at org.apache.fop.layoutmgr.AbstractPageSequenceLayoutManager.makeNewPage(AbstractPageSequenceLayoutManager.java:284) at org.apache.fop.layoutmgr.PageBreaker.handleBreakTrait(PageBreaker.java:494) at org.apache.fop.layoutmgr.PageBreaker.startPart(PageBreaker.java:398) at org.apache.fop.layoutmgr.AbstractBreaker.addAreas(AbstractBreaker.java:421) at org.apache.fop.layoutmgr.AbstractBreaker.addAreas(AbstractBreaker.java:371) at org.apache.fop.layoutmgr.PageBreaker.doPhase3(PageBreaker.java:262) at org.apache.fop.layoutmgr.AbstractBreaker.doLayout(AbstractBreaker.java:346) at org.apache.fop.layoutmgr.AbstractBreaker.doLayout(AbstractBreaker.java:264) at org.apache.fop.layoutmgr.PageSequenceLayoutManager.activateLayout(PageSequenceLayoutManager.java:106) at org.apache.fop.area.AreaTreeHandler.endPageSequence(AreaTreeHandler.java:234) at org.apache.fop.fo.pagination.PageSequence.endOfNode(PageSequence.java:123) at org.apache.fop.fo.FOTreeBuilder$MainFOHandler.endElement(FOTreeBuilder.java:340) at org.apache.fop.fo.FOTreeBuilder.endElement(FOTreeBuilder.java:169) at net.sf.saxon.event.ContentHandlerProxy.endElement(ContentHandlerProxy.java:391) at net.sf.saxon.event.NamespaceReducer.endElement(NamespaceReducer.java:213) at net.sf.saxon.event.ReceivingContentHandler.endElement(ReceivingContentHandler.java:443) at org.apache.xerces.parsers.AbstractSAXParser.endElement(Unknown Source) at org.apache.xerces.xinclude.XIncludeHandler.endElement(Unknown Source) at org.apache.xerces.impl.XMLNSDocumentScannerImpl.scanEndElement(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source) at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source) at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source) at org.apache.xerces.parsers.XMLParser.parse(Unknown Source) at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source) at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source) at net.sf.saxon.event.Sender.sendSAXSource(Sender.java:404) at net.sf.saxon.event.Sender.send(Sender.java:193) at net.sf.saxon.IdentityTransformer.transform(IdentityTransformer.java:30) at org.apache.fop.cli.InputHandler.transformTo(InputHandler.java:214) at org.apache.fop.cli.InputHandler.renderTo(InputHandler.java:125) at org.apache.fop.cli.Main.startFOP(Main.java:166) at org.apache.fop.cli.Main.main(Main.java:197) --8<---------------cut here---------------end--------------->8--- -- , dr hab. Janusz S. Bien, prof. UW - Uniwersytet Warszawski (Katedra Lingwistyki Formalnej) Prof. Janusz S. Bien - Warsaw University (Department of Formal Linguistics) jsbien@uw.edu.pl, jsbien@mimuw.edu.pl, http://fleksem.klf.uw.edu.pl/~jsbien/

Hello, The color depth of your PNG images is 32 bits + 8 bits for alpha channel and that is too large for Apache FOP. Author mode of the XML editor can render all types of PNG images but Apache FOP cannot render them if they have more than 24 bits depth. Your images are black and white and have small resolution (220 x 220 pixels) so 256 colors (8 bits depth) should be enough. If you can save the images to disk with 8 bits or 24 bits depth Apache FOP will handle them correctly. You need to add one of the libraries JIMI or JAI to the FOP classpath for handling PNG images as specified in the User Manual: http://www.oxygenxml.com/doc/ug-oxygen/transforming-documents.html#XSL-FO-pr... Regards, Sorin On 5/6/2010 6:42 PM, Janusz S. Bień wrote:
I have all the images including even those missing in
http://tei.oucs.ox.ac.uk/ENRICH/gBank/gBankImages.zip
This seems to make the difference.
Enclosed please find the full list of error messages (I've removed warnings by hand).
Best regards
Janusz
participants (2)
-
jsbien@mimuw.edu.pl
-
Sorin Ristache