Difference: ToolResourceTextfilters (1 vs. 11)

Revision 112008-11-21 - HennaRiikkaLaitinen

Line: 1 to 1
 
META TOPICPARENT name="Old.ToolResources"
Changed:
<
<

Warning: Can't find topic KitWiki.ToolResourcesTemplateInclude

>
>
<-- 
  • ALLOWTOPICCHANGE = CscGroup
  • Please edit only editing.
-->

textfilters


textfilters, a collection of programs for working with text files

Description

The software consists of several tools for text and document processing:

  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore

Version and Copyright Information

version:

copyright:

Usage

See description.

Help, Manuals and Documentation

help commands:
man

further information:

Bugs

License Text

Other Information

Field of science: Linguistics

Available:
corpus

License: LicenseTypePGnu

To be copied to: https://wwwk.csc.fi/english/research/software/textfilters
To be seen at: http://www.csc.fi/english/research/software/textfilters
See also: KitWiki.SuomenKielipankki:Dev:Linguistics_Software, Old.ToolResources
The users may add their own comments to: ToolResource_textfilters_Comments

  When editing, please move cursor to the form below. Do not add anything here.

Revision 102008-11-10 - HennaRiikkaLaitinen

Line: 1 to 1
Changed:
<
<
META TOPICPARENT name="ToolResources"
<-- 
  • ALLOWTOPICCHANGE = CscGroup
  • Please edit only editing.
-->

textfilters


textfilters, a collection of programs for working with text files

Description

The software consists of several tools for text and document processing:

  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore

Version and Copyright Information

version:

copyright:

Usage

See description.

Help, Manuals and Documentation

help commands:
man

further information:

Bugs

License Text

Other Information

Field of science: Linguistics

Available:
corpus

License: LicenseTypePGnu

To be copied to: https://wwwk.csc.fi/english/research/software/textfilters
To be seen at: http://www.csc.fi/english/research/software/textfilters
See also: KitWiki.SuomenKielipankki:Dev:Linguistics_Software, Old.ToolResources
The users may add their own comments to: ToolResource_textfilters_Comments

>
>
META TOPICPARENT name="Old.ToolResources"

Warning: Can't find topic KitWiki.ToolResourcesTemplateInclude

  When editing, please move cursor to the form below. Do not add anything here.
Line: 35 to 35
 
FORM FIELD FirstFewLinks FirstFewLinks
FORM FIELD ResourceMaintainer ResourceMaintainer CSC
FORM FIELD DeprecatedUrls DeprecatedUrls
Changed:
<
<
META TOPICMOVED by="AnssiYliJyra" date="1161860494" from="KitWiki.ToolResource_unixfiletools" to="KitWiki.ToolResource_textfilters"
>
>
META TOPICMOVED by="HennaRiikkaLaitinen" date="1226317658" from="KitWiki.ToolResource_textfilters" to="Old.ToolResourceTextfilters"

Revision 92008-11-10 - HennaRiikkaLaitinen

Line: 1 to 1
 
META TOPICPARENT name="ToolResources"
<-- 
  • ALLOWTOPICCHANGE = CscGroup
  • Please edit only editing.
-->

textfilters


textfilters, a collection of programs for working with text files

Description

The software consists of several tools for text and document processing:

  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore

Version and Copyright Information

version:

copyright:

Usage

See description.

Help, Manuals and Documentation

help commands:
man

further information:

Bugs

License Text

Other Information

Field of science: Linguistics

Available:
corpus

License: LicenseTypePGnu

To be copied to: https://wwwk.csc.fi/english/research/software/textfilters
To be seen at: http://www.csc.fi/english/research/software/textfilters
See also: KitWiki.SuomenKielipankki:Dev:Linguistics_Software, Old.ToolResources
The users may add their own comments to: ToolResource_textfilters_Comments

  • Set EDITBOXHEIGHT = 4
Line: 21 to 21
 
FORM FIELD HelpCommand HelpCommand man
FORM FIELD HelpText HelpText
FORM FIELD FullName FullName textfilters
Changed:
<
<
|*FORM FIELD Description*|Description|The software consists of several tools for text and document processing:
  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore
|
>
>
|*FORM FIELD Description*|Description|The software consists of several tools for text and document processing:
  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore
|
 
FORM FIELD Version Version
FORM FIELD Usage Usage See description.
FORM FIELD Bugs Bugs

Revision 82007-03-02 - EeroVitie

Line: 1 to 1
 
META TOPICPARENT name="ToolResources"
<-- 
  • ALLOWTOPICCHANGE = CscGroup
  • Please edit only editing.
-->

textfilters


textfilters, a collection of programs for working with text files

Description

The software consists of several tools for text and document processing:

  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore

Version and Copyright Information

version:

copyright:

Usage

See description.

Help, Manuals and Documentation

help commands:
man

further information:

Bugs

License Text

Other Information

Field of science: Linguistics

Available:
corpus

License: LicenseTypePGnu

To be copied to: https://wwwk.csc.fi/english/research/software/textfilters
To be seen at: http://www.csc.fi/english/research/software/textfilters
See also: KitWiki.SuomenKielipankki:Dev:Linguistics_Software, Old.ToolResources
The users may add their own comments to: ToolResource_textfilters_Comments

  • Set EDITBOXHEIGHT = 4
Line: 25 to 25
 
FORM FIELD Version Version
FORM FIELD Usage Usage See description.
FORM FIELD Bugs Bugs
Changed:
<
<
FORM FIELD ShortDescription ShortDescription a collection of unix tools to work with text files
>
>
FORM FIELD ShortDescription ShortDescription a collection of programs for working with text files
 
FORM FIELD Citation Citation
FORM FIELD InstallationProcedure InstallationProcedure
FORM FIELD CurrentConfiguration CurrentConfiguration

Revision 72007-03-02 - EeroVitie

Line: 1 to 1
 
META TOPICPARENT name="ToolResources"
<-- 
  • ALLOWTOPICCHANGE = CscGroup
  • Please edit only editing.
-->

textfilters


textfilters, a collection of programs for working with text files

Description

The software consists of several tools for text and document processing:

  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore

Version and Copyright Information

version:

copyright:

Usage

See description.

Help, Manuals and Documentation

help commands:
man

further information:

Bugs

License Text

Other Information

Field of science: Linguistics

Available:
corpus

License: LicenseTypePGnu

To be copied to: https://wwwk.csc.fi/english/research/software/textfilters
To be seen at: http://www.csc.fi/english/research/software/textfilters
See also: KitWiki.SuomenKielipankki:Dev:Linguistics_Software, Old.ToolResources
The users may add their own comments to: ToolResource_textfilters_Comments

  • Set EDITBOXHEIGHT = 4
Line: 7 to 7
 

META FORM name="ToolResourcesForm"
Changed:
<
<
FORM FIELD UnixLocation UnixLocation
FORM FIELD WebInterface WebInterface
FORM FIELD Group Group all
FORM FIELD Copyright Copyright
FORM FIELD LicenseType LicenseType LicenseTypePGnu
FORM FIELD License License GPL
FORM FIELD LicenseText LicenseText
FORM FIELD ProviderName ProviderName GNU
FORM FIELD LatestVersion LatestVersion
FORM FIELD HomePage HomePage
FORM FIELD ShortName ShortName textfilters
FORM FIELD HelpCommand HelpCommand man
FORM FIELD HelpText HelpText
FORM FIELD FullName FullName textfilters
|*FORM FIELD Description*|Description|The software consists of several tools for text and document processing:
  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore
|
FORM FIELD Version Version
FORM FIELD Usage Usage
FORM FIELD Bugs Bugs
FORM FIELD ShortDescription ShortDescription a local collection of tools and filters for text files
FORM FIELD Citation Citation
FORM FIELD InstallationProcedure InstallationProcedure
FORM FIELD CurrentConfiguration CurrentConfiguration
FORM FIELD Discussion Discussion
FORM FIELD CscContacts CscContacts
FORM FIELD ProviderContacts ProviderContacts
FORM FIELD FirstFewLinks FirstFewLinks
FORM FIELD ResourceMaintainer ResourceMaintainer CSC
>
>
FORM FIELD UnixLocation UnixLocation
FORM FIELD WebInterface WebInterface
FORM FIELD Group Group all
FORM FIELD Copyright Copyright
FORM FIELD LicenseType LicenseType LicenseTypePGnu
FORM FIELD License License GPL
FORM FIELD LicenseText LicenseText
FORM FIELD ProviderName ProviderName GNU
FORM FIELD LatestVersion LatestVersion
FORM FIELD HomePage HomePage
FORM FIELD ShortName ShortName textfilters
FORM FIELD HelpCommand HelpCommand man
FORM FIELD HelpText HelpText
FORM FIELD FullName FullName textfilters
|*FORM FIELD Description*|Description|The software consists of several tools for text and document processing:
  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore
|
FORM FIELD Version Version
FORM FIELD Usage Usage See description.
FORM FIELD Bugs Bugs
FORM FIELD ShortDescription ShortDescription a collection of unix tools to work with text files
FORM FIELD Citation Citation
FORM FIELD InstallationProcedure InstallationProcedure
FORM FIELD CurrentConfiguration CurrentConfiguration
FORM FIELD Discussion Discussion
FORM FIELD CscContacts CscContacts
FORM FIELD ProviderContacts ProviderContacts
FORM FIELD FirstFewLinks FirstFewLinks
FORM FIELD ResourceMaintainer ResourceMaintainer CSC
FORM FIELD DeprecatedUrls DeprecatedUrls
 
META TOPICMOVED by="AnssiYliJyra" date="1161860494" from="KitWiki.ToolResource_unixfiletools" to="KitWiki.ToolResource_textfilters"

Revision 62006-12-05 - AnssiYliJyra

Line: 1 to 1
 
META TOPICPARENT name="ToolResources"
<-- 
  • ALLOWTOPICCHANGE = CscGroup
  • Please edit only editing.
-->

textfilters


textfilters, a collection of programs for working with text files

Description

The software consists of several tools for text and document processing:

  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore

Version and Copyright Information

version:

copyright:

Usage

See description.

Help, Manuals and Documentation

help commands:
man

further information:

Bugs

License Text

Other Information

Field of science: Linguistics

Available:
corpus

License: LicenseTypePGnu

To be copied to: https://wwwk.csc.fi/english/research/software/textfilters
To be seen at: http://www.csc.fi/english/research/software/textfilters
See also: KitWiki.SuomenKielipankki:Dev:Linguistics_Software, Old.ToolResources
The users may add their own comments to: ToolResource_textfilters_Comments

  • Set EDITBOXHEIGHT = 4

Revision 52006-12-05 - AnssiYliJyra

Line: 1 to 1
 
META TOPICPARENT name="ToolResources"
<-- 
  • ALLOWTOPICCHANGE = CscGroup
  • Please edit only editing.
-->

textfilters


textfilters, a collection of programs for working with text files

Description

The software consists of several tools for text and document processing:

  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore

Version and Copyright Information

version:

copyright:

Usage

See description.

Help, Manuals and Documentation

help commands:
man

further information:

Bugs

License Text

Other Information

Field of science: Linguistics

Available:
corpus

License: LicenseTypePGnu

To be copied to: https://wwwk.csc.fi/english/research/software/textfilters
To be seen at: http://www.csc.fi/english/research/software/textfilters
See also: KitWiki.SuomenKielipankki:Dev:Linguistics_Software, Old.ToolResources
The users may add their own comments to: ToolResource_textfilters_Comments

  • Set EDITBOXHEIGHT = 4
Line: 33 to 33
 
FORM FIELD CscContacts CscContacts
FORM FIELD ProviderContacts ProviderContacts
FORM FIELD FirstFewLinks FirstFewLinks
Added:
>
>
FORM FIELD ResourceMaintainer ResourceMaintainer CSC
 
META TOPICMOVED by="AnssiYliJyra" date="1161860494" from="KitWiki.ToolResource_unixfiletools" to="KitWiki.ToolResource_textfilters"

Revision 42006-11-08 - AnssiYliJyra

Line: 1 to 1
 
META TOPICPARENT name="ToolResources"
<-- 
  • ALLOWTOPICCHANGE = CscGroup
  • Please edit only editing.
-->

textfilters


textfilters, a collection of programs for working with text files

Description

The software consists of several tools for text and document processing:

  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore

Version and Copyright Information

version:

copyright:

Usage

See description.

Help, Manuals and Documentation

help commands:
man

further information:

Bugs

License Text

Other Information

Field of science: Linguistics

Available:
corpus

License: LicenseTypePGnu

To be copied to: https://wwwk.csc.fi/english/research/software/textfilters
To be seen at: http://www.csc.fi/english/research/software/textfilters
See also: KitWiki.SuomenKielipankki:Dev:Linguistics_Software, Old.ToolResources
The users may add their own comments to: ToolResource_textfilters_Comments

  • Set EDITBOXHEIGHT = 4
Line: 20 to 20
 
FORM FIELD ShortName ShortName textfilters
FORM FIELD HelpCommand HelpCommand man
FORM FIELD HelpText HelpText
Changed:
<
<
FORM FIELD FullName FullName
>
>
FORM FIELD FullName FullName textfilters
 |*FORM FIELD Description*|Description|The software consists of several tools for text and document processing:
  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore
|
FORM FIELD Version Version
FORM FIELD Usage Usage
FORM FIELD Bugs Bugs
Changed:
<
<
FORM FIELD ShortDescription ShortDescription text filters
>
>
FORM FIELD ShortDescription ShortDescription a local collection of tools and filters for text files
FORM FIELD Citation Citation
FORM FIELD InstallationProcedure InstallationProcedure
FORM FIELD CurrentConfiguration CurrentConfiguration
FORM FIELD Discussion Discussion
FORM FIELD CscContacts CscContacts
FORM FIELD ProviderContacts ProviderContacts
FORM FIELD FirstFewLinks FirstFewLinks
 
META TOPICMOVED by="AnssiYliJyra" date="1161860494" from="KitWiki.ToolResource_unixfiletools" to="KitWiki.ToolResource_textfilters"

Revision 32006-11-02 - AnssiYliJyra

Line: 1 to 1
 
META TOPICPARENT name="ToolResources"
<-- 
  • ALLOWTOPICCHANGE = CscGroup
  • Please edit only editing.
-->

textfilters


textfilters, a collection of programs for working with text files

Description

The software consists of several tools for text and document processing:

  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore

Version and Copyright Information

version:

copyright:

Usage

See description.

Help, Manuals and Documentation

help commands:
man

further information:

Bugs

License Text

Other Information

Field of science: Linguistics

Available:
corpus

License: LicenseTypePGnu

To be copied to: https://wwwk.csc.fi/english/research/software/textfilters
To be seen at: http://www.csc.fi/english/research/software/textfilters
See also: KitWiki.SuomenKielipankki:Dev:Linguistics_Software, Old.ToolResources
The users may add their own comments to: ToolResource_textfilters_Comments

  • Set EDITBOXHEIGHT = 4
Line: 11 to 11
 
FORM FIELD WebInterface WebInterface
FORM FIELD Group Group all
FORM FIELD Copyright Copyright
Changed:
<
<
FORM FIELD LicenseType LicenseType LicenseTypeGeneralPublic
>
>
FORM FIELD LicenseType LicenseType LicenseTypePGnu
 
FORM FIELD License License GPL
FORM FIELD LicenseText LicenseText
FORM FIELD ProviderName ProviderName GNU

Revision 22006-11-02 - AnssiYliJyra

Line: 1 to 1
 
META TOPICPARENT name="ToolResources"
<-- 
  • ALLOWTOPICCHANGE = CscGroup
  • Please edit only editing.
-->

textfilters


textfilters, a collection of programs for working with text files

Description

The software consists of several tools for text and document processing:

  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore

Version and Copyright Information

version:

copyright:

Usage

See description.

Help, Manuals and Documentation

help commands:
man

further information:

Bugs

License Text

Other Information

Field of science: Linguistics

Available:
corpus

License: LicenseTypePGnu

To be copied to: https://wwwk.csc.fi/english/research/software/textfilters
To be seen at: http://www.csc.fi/english/research/software/textfilters
See also: KitWiki.SuomenKielipankki:Dev:Linguistics_Software, Old.ToolResources
The users may add their own comments to: ToolResource_textfilters_Comments

  • Set EDITBOXHEIGHT = 4
Line: 11 to 11
 
FORM FIELD WebInterface WebInterface
FORM FIELD Group Group all
FORM FIELD Copyright Copyright
Changed:
<
<
FORM FIELD LicenseType LicenseType C
>
>
FORM FIELD LicenseType LicenseType LicenseTypeGeneralPublic
 
FORM FIELD License License GPL
FORM FIELD LicenseText LicenseText
FORM FIELD ProviderName ProviderName GNU
FORM FIELD LatestVersion LatestVersion
FORM FIELD HomePage HomePage
Changed:
<
<
FORM FIELD ShortName ShortName textfilters
>
>
FORM FIELD ShortName ShortName textfilters
 
FORM FIELD HelpCommand HelpCommand man
FORM FIELD HelpText HelpText
Changed:
<
<
FORM FIELD FullName FullName GNU Text Filters
>
>
FORM FIELD FullName FullName
 |*FORM FIELD Description*|Description|The software consists of several tools for text and document processing:
  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore
|
FORM FIELD Version Version
FORM FIELD Usage Usage
FORM FIELD Bugs Bugs
Added:
>
>
FORM FIELD ShortDescription ShortDescription text filters
 
META TOPICMOVED by="AnssiYliJyra" date="1161860494" from="KitWiki.ToolResource_unixfiletools" to="KitWiki.ToolResource_textfilters"

Revision 12006-10-26 - AnssiYliJyra

Line: 1 to 1
Added:
>
>
META TOPICPARENT name="ToolResources"
<-- 
  • ALLOWTOPICCHANGE = CscGroup
  • Please edit only editing.
-->

textfilters


textfilters, a collection of programs for working with text files

Description

The software consists of several tools for text and document processing:

  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore

Version and Copyright Information

version:

copyright:

Usage

See description.

Help, Manuals and Documentation

help commands:
man

further information:

Bugs

License Text

Other Information

Field of science: Linguistics

Available:
corpus

License: LicenseTypePGnu

To be copied to: https://wwwk.csc.fi/english/research/software/textfilters
To be seen at: http://www.csc.fi/english/research/software/textfilters
See also: KitWiki.SuomenKielipankki:Dev:Linguistics_Software, Old.ToolResources
The users may add their own comments to: ToolResource_textfilters_Comments

When editing, please move cursor to the form below. Do not add anything here.

META FORM name="ToolResourcesForm"
FORM FIELD UnixLocation UnixLocation
FORM FIELD WebInterface WebInterface
FORM FIELD Group Group all
FORM FIELD Copyright Copyright
FORM FIELD LicenseType LicenseType C
FORM FIELD License License GPL
FORM FIELD LicenseText LicenseText
FORM FIELD ProviderName ProviderName GNU
FORM FIELD LatestVersion LatestVersion
FORM FIELD HomePage HomePage
FORM FIELD ShortName ShortName textfilters
FORM FIELD HelpCommand HelpCommand man
FORM FIELD HelpText HelpText
FORM FIELD FullName FullName GNU Text Filters
|*FORM FIELD Description*|Description|The software consists of several tools for text and document processing:
  • aspell
  • awk
  • cat (1) - concatenate files and print on the standard output
  • cmp (1) - compare two files
  • cut (1) - remove sections from each line of files
  • detect-encoding - analyses the characters of a file
  • diff (1) - find differences between two files
  • dos2unix (1) - DOS/MAC to UNIX text file format converter
  • dos2unix [mac2unix] (1) - DOS/MAC to UNIX text file format converter
  • dos2unix, unix2dos can be used to translate between Unix and Win/Dos formats (available in some CSC machines, and hopefully in corpus.csc.fi)
  • egrep [grep] (1) - print lines matching a pattern
  • examine
  • fgrep [grep] (1) - print lines matching a pattern
  • file (1) - determine file type
  • gawk (1) - pattern scanning and processing language
  • gawk [pgawk] (1) - pattern scanning and processing language
  • grep (1) - print lines matching a pattern
  • groff (1) - front-end for the groff document formatting system, see man 7 groff for a short reference for the GNU roff language
  • grotty (1) - groff driver for typewriter-like devices
  • gs (1) - Ghostscript (PostScript and PDF language interpreter and previewer)
  • gzgrep
  • gzmore
  • head (1) - output the first part of files
  • iconv (1) - Convert encoding of given files from one encoding to another
  • less (1) - opposite of more
  • lesspipe.sh
  • linefreq - a tool for counting line counts using a hash table rather than sorting
  • locale (1) - Get locale - specific information
  • mac2unix
  • more (1) - file perusal filter for crt viewing
  • nroff (1) - emulate nroff command with groff
  • od - can be used to dump and see the contents of files in hexadecimal format e.g. od -c -tx1
  • perl (1) - Practical Extraction and Report Language
  • pico
  • quota (1) - display disk usage and limits
  • rev (1) - reverse lines of a file
  • sed (1) - manual page for sed version 4.1.2
  • sort (1) - sort lines of text files
  • sort (3pm) - perl pragma to control sort() behaviour
  • stty (1) - change and print terminal line settings
  • tac (1) - concatenate and print files in reverse
  • tail (1) - output the last part of files
  • tei2snt
  • tr (1) - translate or delete characters
  • troff (1) - the troff processor of the groff text formatting system
  • uniq (1) - remove duplicate lines from a sorted file
  • unix2mac
  • wc (1) - print the number of newlines, words, and bytes in files
  • zcat
  • xdvi
  • zmore
|
FORM FIELD Version Version
FORM FIELD Usage Usage
FORM FIELD Bugs Bugs
META TOPICMOVED by="AnssiYliJyra" date="1161860494" from="KitWiki.ToolResource_unixfiletools" to="KitWiki.ToolResource_textfilters"
 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2019 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback