This web is for holding topics deemed as old or irrelevant for KitWiki. If you think the topic doesn't belong here, please check that it's named properly (is a WikiWord) and descriptively, contains relevant data, and is put back to a relevant web.

textfilters


textfilters, a collection of programs for working with text files

Description

The software consists of several tools for text and document processing:

  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore

Version and Copyright Information

version:

copyright:

Usage

See description.

Help, Manuals and Documentation

help commands:
man

further information:

Bugs

License Text

Other Information

Field of science: Linguistics

Available:
corpus

License: LicenseTypePGnu

To be copied to: https://wwwk.csc.fi/english/research/software/textfilters
To be seen at: http://www.csc.fi/english/research/software/textfilters
See also: KitWiki.SuomenKielipankki:Dev:Linguistics_Software, Old.ToolResources
The users may add their own comments to: ToolResource_textfilters_Comments

When editing, please move cursor to the form below. Do not add anything here.
Topic revision: r11 - 2008-11-21 - HennaRiikkaLaitinen
 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2018 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback