From bf598c02ba03e5f438c3cce321338694529126e5 Mon Sep 17 00:00:00 2001 From: Jim Meyering Date: Thu, 7 May 1998 14:32:59 +0000 Subject: add tr `double' script example --- doc/textutils.texi | 18 ++++++++++++++++++ 1 file changed, 18 insertions(+) (limited to 'doc') diff --git a/doc/textutils.texi b/doc/textutils.texi index 91329ec53..0eaf3bbc2 100644 --- a/doc/textutils.texi +++ b/doc/textutils.texi @@ -2696,6 +2696,24 @@ Convert each sequence of repeated newlines to a single newline: tr -s '\n' @end example +@item +Find doubled occurrences of words in a document. +For example, people often write ``the the'' with the duplicated words +separated by a newline. The bourne shell script below works first +by converting each sequence of punctuation and blank characters to a +single newline. That puts each ``word'' on a line by itself. +Next it maps all uppercase characters to lower case, and finally it +runs @code{uniq} with the @samp{-d} option to print out only the words +that were adjacent duplicates. + +@example +#!/bin/sh +cat "$@@" \ + | tr -s '[:punct:][:blank:]' '\n' \ + | tr '[:upper:]' '[:lower:]' \ + | uniq -d +@end example + @end itemize -- cgit v1.2.3-54-g00ecf