i am having a problem with an indented XML File. The File looks like this:
<?xml version="1.0" encoding="UTF-8"?>
<TEI xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://www.i-d-e.de/ns/1.0">
<teiHeader>
<fileDesc>
<titleStmt>
<title>MS Einsiedeln
</title>
</titleStmt>
<publicationStmt>
<p>publicationsStmt empty
</p>
</publicationStmt><sourceDesc>
<p>sourceDesc empty
</p>
</sourceDesc></fileDesc>
</teiHeader>
<text>
<body>
<div>
<div>
<div>
<p><c>D
</c>ie gotheit iſt beſloſſen
<lb/>in dem vater n
<ex>atur
</ex>elich
dar
<lb/>vmbe iſt er alvermvgende
<lb/>vnd enpfat niht von ite
<lb
/>des
<gap reason=""/> er ſelber nit en iſt an
<lb/>ſiner go
<unclear
>tl
</unclear>icher macht wan
<lb/>ers weſelich i
<ex>n
</ex> ime vnd
an
<lb/>ime ſelben beſloſſen hat
<space unit="letters" quantity="1"
/></p> </div>
</div>
</div>
</body>
</text>
</TEI>
Now, using the following XPath 2.0 expression: //text(), the tabs are returned as text-nodes, for example the first tab before the tag <teiHeader>. In fact, my DTD does not allow #PCDATA inside <TEI>, but the document is validated without any problems. To me this seems kind of schizophrenic, or am I mistaken? Btw: the same file in XMLSpy with its build-in xslt engine as well as MS XML parser with the same xPath expression does not return the tabs as text-nodes.
Any ideas?
Philipp
PS: I am using Oxygen 9.3
--
Philipp Steinkrüger M.A.
Philosophisches Seminar der Universität zu Köln
Thomas-Institut
Universitätsstraße 22
50923 Köln
+49 221 4702394
UNIVERSITÄT ZU KÖLN
GUTE IDEEN. SEIT 1388.