NU IT
Northwestern University Information Technology
MorphAdorner Northwestern
 
Sampling Text Files

MorphAdorner provides two utilities for sampling lines from text files: ExactlySampleTextFile and RandomlySampleTextFile

ExactlySampleTextFile usage:

exactlysampletextfile input.txt output.txt samplecount

where

  • input.txt -- input text file to be sampled.
  • output.txt -- output text file.
  • samplecount -- Size of exact random sample to extract. Must be positive integer.

The output file is a text file containing the sampled text lines from the input file. Both the input and the output must be utf-8 encoded.

RandomlySampleTextFile usage:

randomlysampletextfile input.txt output.txt samplingpercent

where

  • input.txt -- input text file to be sampled.
  • output.txt -- output text file.
  • samplingpercent -- sampling percent from 0 through 100.

The output file is a text file containing the sampled text lines from the input file. Both the input and the output must be utf-8 encoded.

Home
 
Announcements and News
 
Documentation
 
Download MorphAdorner
 
Glossary
 
Helpful References
 
Licenses
 
Server
 
Talks
 
Tech Talk