This book constitutes the thoroughly refereed proceedings of the 7th International Workshop on Computational Processing of the Portuguese Language, PROPOR 2006. The 20 revised full papers and 17 revised short papers presented here are organized in topical sections on automatic summarization, resources, translation, named entity recognition, tools and frameworks, systems and models, information extraction, speech processing, lexicon, morpho-syntactic studies, and Web, corpus and evaluation.
Summarization.- Modeling and Evaluating Summaries Using Complex Networks.- SABio: An Automatic Portuguese Text Summarizer Through Artificial Neural Networks in a More Biologically Plausible Model.- Resources.- Building a Dictionary of Anthroponyms.- REPENTINO – A Wide-Scope Gazetteer for Entity Recognition in Portuguese.- Translation.- Using Natural Alignment to Extract Translation Equivalents.- Open-Source Portuguese–Spanish Machine Translation.- Weighted Finite-State Transducer Inference for Limited-Domain Speech-to-Speech Translation.- Named Entity Recognition.- A Golden Resource for Named Entity Recognition in Portuguese.- Functional Aspects in Portuguese NER.- SIEMÊS – A Named-Entity Recognizer for Portuguese Relying on Similarity Rules.- Tools and Frameworks.- Tools for Nominalization: An Alternative for Lexical Normalization.- A Framework for Integrating Natural Language Tools.- Methods and Tools for Encoding the WordNet.Br Sentences, Concept Glosses, and Conceptual-Semantic Relations.- A Multi-agent Approach to Question Answering.- Systems and Models.- Adaptation of Data and Models for Probabilistic Parsing of Portuguese.- A Set of NP-Extraction Rules for Portuguese: Defining, Learning and Pruning.- Resolving Portuguese Nominal Anaphora.- Design of a Multimodal Input Interface for a Dialogue System.- Review and Evaluation of DiZer – An Automatic Discourse Analyzer for Brazilian Portuguese.- Classroom Lecture Recognition.- Information Extraction.- Semi-supervised Learning for Portuguese Noun Phrase Extraction.- Automatic Extraction of Keywords for the Portuguese Language.- Semi-automatically Building Ontological Structures from Portuguese Written Texts.- Speech Processing.- On the Use of Machine Learning and Syllable Information in European Portuguese Grapheme-Phone Conversion.- A Model to Computational Speech Understanding.- Phonetic Sequence to Graphemes Conversion Based on DTW and One-Stage Algorithms.- Lexicon.- Very Strict Selectional Restrictions: A Comparison Between Portuguese and French.- Towards a Formalization of Tense and Aspect for the Generation of Portuguese Sentences.- The Need for Application-Dependent WSD Strategies: A Case Study in MT.- A Lexical Database of Portuguese Multiword Expressions.- Morpho-syntactic Studies.- Dedicated Nominal Featurization of Portuguese.- Part-of-Speech Tagging of Portuguese Based on Variable Length Markov Chains.- From Syntactical Analysis to Textual Segmentation.- Syntactical Annotation of COMPARA: Workflow and First Results.- Web, Corpus and Evaluation.- A Complex Evaluation Architecture for HAREM.- What Kinds of Geographical Information Are There in the Portuguese Web?.- Corpus-Based Compositionality.