Subject: book on sed and awk in language research

combining the bourne - shell , sed and awk in the unix environment for language analysis = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = lothar m . schmitt ( 1 ) and kiel t . christianson ( 2 ) 1 . the university of aizu school of computer science and engineering , aizu - wakamatsu city , fukushima prefecture , 965-80 , japan . e - mail : lothar @ u-aizu . ac . jp 2 . michigan state university dept . of linguistics and germanic , slavic , asian and african languages east lansing , michigan 48824 , usa . = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = abstract = = = = = = = = we show how to construct tools for language analysis in research and teaching using the bourne - shell , sed and awk under unix . applications include the following : searches for words , phrases , grammatical patterns and phonemic patterns in text ; statistical evaluation of texts in regard to such searches ; transformation of phonetic , phonemic or typographic transcriptions ; comparison of texts in various respects ; lexical-etymological analysis ; concordance ; assistance in translating text ; assistance in learning languages ; assistance in teaching languages ; and text processing and formatting . the latter includes the generation of on-line dictionaries for the internet from files that were generated with what-you - see-is - what-you - get editors representing only the linear structure of the dictionary ( i . e . , the book ) . all of the above can be achieved with particularly simple and short code . in that regard , we illustrate how sed and awk can be combined in the pipe mechanism of unix to create very powerful processing devices . our notes include a short introduction to programming the bourne - shell and rather short , but complete descriptions of sed and awk customized in regard to language analysis . * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * availability : 1 ) eric document service ( soon ) , 2 ) e - mail lms , 3 ) possible posting .
