Skip to main content

Archive for October, 2007

Survey of the State of the Art in Human Language Technology


From the foreword:

“This book surveys the state of the art of human language technology. The goal of the survey is to provide an interested reader with an overview of the field—the main areas of work, the capabilities and limitations of current technology, and the technical challenges that must be overcome to realize the vision of graceful human computer interaction using natural communication skills.”

Provided by the Center for Spoken Language Understanding at Oregon Graduate Institute of Science & Technology, published 1996

Practical Content Analysis Techniques for Text-Retrieval in Large, Un-tagged Text-bases


Paper by Raymond G. Siemens, Department of English, University of British Columbia, published in Proceedings of the 11th annual international conference on Systems documentation


“This paper intends to provide a practical response to concerns regarding the retrieval of specific information from electronic text-bases which have no or minimal text markup. It proposes that standard techniques employed by social scientists and others involved in computer-assisted content analysis be adapted to assist in text-retrieval, and provides the context necessary to employ such techniques.”

Textual analysis of popular culture


Gives information on semiotics and content analysis of popular culture artifacts; categories for different genres and issues.


This site by Adam Schneider and G. William Domhoff, University of California, Santa Cruz, features the quantitative study of dreams.
Contains technical information about methods and findings, examples for coded dreams, hints for students, links etc.
Calvin S. Hall’s (1909-1985) system of content analysis is used and explained.

A Content Analysis of Internet-Accessible Written Pornographic Depications


article written by Denna Harmon and Scot B. Boeringer, Appalachian State University; published in Electronic Journal of Sociology (1997)

Marketing Pornography on the Information Superhighway


Summary: For the Carnegie Mellon study, the research team downloaded all available pornographic images from five popular Usenet boards over a four month period. This narrowed the list to about 500 boards that focused primarily on marketing “adult” pornographic images, or general boards with large “adult” sections in addition to other general interest material.
Carnegie Mellon study on pornography on the Internet featured in Time Magazine and on ABC-TV’s “Nightline”

University of Ulm, Germany, Computer sciences in psychotherapy


Former section of the Clinic for Psychosomatic Medicine and Psychotherapy. Content analysis of psychotherapy dialogs and reports, maintenance of text database (“Ulmer Textbank”), headed by Erhard Mergenthaler.

Association for Computational Linguistics


“The Association for Computational Linguistics is THE international scientific and professional society for people working on problems involving natural language and computation.”

Contains information about the ACL quarterly journal Computational Linguistics, conferences, membership and an extensive collection of links to computational linguistics and natural language processing resources, the ACL NLP/CL Universe

International Computer Archive of Modern and Medieval English


“ICAME is an international organization of linguists and information scientists working with English machine-readable texts. The aim of the organization is to collect and distribute information on English language material available for computer processing and on linguistic research completed or in progress on the material, to compile an archive of English text corpora in machine-readable form, and to make material available to research institutions.”

Contains links to:

Association for Computers and the Humanities


“The Association for Computers and the Humanities is an international professional organization. Since its establishment, it has been the major professional society for people working in computer-aided research in literature and language studies, history, philosophy, and other humanities disciplines, and especially research involving the manipulation and analysis of textual materials.”


  • administrative issues
  • information on events
  • an extensive archive