NU IT
Northwestern University Information Technology
MorphAdorner Northwestern
 
MorphAdorner Server Services: Extract Sentences Service

Service name: teiadornedtosentences
Service description: Extract sentences from an adorned TEI XML file.
HTTP methods allowed: POST, OPTIONS
POST accepts as input: application/x-www-form-urlencoded
HTTP return codes: 200: service succeeded
400: service failed with an error

Query parameters

    media Result format. Only text allowed.
    resultsAsAttachedFile Allowed values are true to send the results as an attached file, and false to send the results as a data stream.
    mainTextOnly true to return sentences only from main text, false to return sentences from all of the text.
    teifile TEI input file.

Sample POST form

<form accept-charset="UTF-8" method="post" action="teiadornedtosentences"
      target="_blank"
      enctype="multipart/form-data" name="teiadornedtosentences">
<table cellpadding="0" cellspacing="5">
<tr>
<td>
<strong>Adorned TEI XML file:</strong>
</td>
<td>
<input type="file" name="teifile" size="50">
</td>
</tr>
<tr>
<td>&nbsp;</td>
<td>
<input type="checkbox" name="mainTextOnly" value="true"
       checked="checked"/>
Only return sentences in main text
</td>
</tr>
<tr>
<td>&nbsp;</td>
<td>
<input type="checkbox" name="resultsAsAttachedFile" value="true"
       checked="checked"/>
Send results as attached file
</td>
</tr>
<tr>
<td>
&nbsp;
</td>
<td>
&nbsp;
</td>
</tr>
<tr>
<td colspan="2">
<input type="submit" name="extract" value="Extract Sentences" />
</td>
</tr>
</table>
</form>

Output

The output is return as a sequence of utf-8 encoded text lines, one sentence per line. When mainTextOnly is true, at least the first word of a sentence must be present in the main part of the text.

Home
 
Announcements and News
 
Documentation
 
Download MorphAdorner
 
Glossary
 
Helpful References
 
Licenses
 
Server
 
Talks
 
Tech Talk