lunduniversity.lu.se

Computer Science

Faculty of Engineering, LTH

Denna sida på svenska This page in English

EDAN20 – Language Technology

EDAN20 – Language Technology

The schedule for August/September/October 2016 is available here:https://se.timeedit.net/web/lu/db1/lth1/. Use the course schedule or lässchema links and enter EDAN20 to view the course schedule.

The first lecture will be held on August 29, 2016, 15-17. The lecture room is M:E

The course takes place in the HT1 “study period.”

Objectives

The course introduces theories and techniques of natural language processing and language technology. It attempts to cover the whole field from speech recognition and synthesis to semantics and dialogue.

It focuses on industrial or laboratory applications, such as document retrieval on the Internet, information extraction, conversational agents, and verbal interaction in virtual worlds. Fundamental algorithms will be described using Python, Prolog, or regular expressions.

Course contents

  • An overview of language processing: presentation of language processing, applications, disciplines of linguistic, examples
  • Corpus and word processing: regular expressions, automata, an introduction to Python, concordances, tokenization, counting words, collocations
  • Morphology and part-of-speech tagging: morphology, transducers, part-of-speech tagging,
  • Prolog to write phrase-structure grammars: constituents, trees, using Prolog to do natural language analysis, DCG rules, variables, getting the syntactic structure, compositional analysis to get the semantic structure.
  • Syntactic formalisms: constituency and dependency, chart parsing, statistical parsing, functions, dependency parsing.
  • Semantics: formal semantics, lambda-calculus, compositionality: nouns, verbs, determiners, words and meaning, lexical semantics, case grammars, semantic grammars
  • Discourse and dialogue: discourse and rhetoric, anaphora, structure, RST, dialogue: automata, pairs, speech acts, multimodality.
  • Overview of speech synthesis and speech recognition

Textbook

As textbook, I will use:Language processing with Perl and Prolog, 2nd edition, 2014, Springer. It is available from Springer link: [html] [pdf], or in a paper version [html].

Students can also use the first edition from 2006,An Introduction to Language Processing with Perl and Prolog. The electronic version is available for free: [pdf]. You need to be logged from Lund University accounts to have a free copy. The paper version of the first edition costs 25 euros: [html].

Ges: Läsperiod HT1

Kontaktperson: Pierre Nugues

Förkunskapskrav: Se kursplanen.

OBS! Kursen ges på engelska

Kurswebb: http://cs.lth.se/edan20

Page Manager:

Facts about the course

EDAN20: Language Technology

Higher education credits: 7.5

Grading scale: UG

Level: A

Language of instruction: The course might be given in English

Course coordinator: Pierre Nugues

E-mail: Pierre.Nugues@cs.lth.se

Prerequisites: EDAA01 Programming — Second Course or EDA027 Algorithms and Data Structures

Admission specifics: The number of participants is limited to 58

Assessment: Compulsory course items: Six assignments. Optional examination.

Home page: cs.lth.se/edan20

Further information/Transitional rules: Limited number of participants