Natural Language Processing (COMP90042)
Graduate courseworkPoints: 12.5Dual-Delivery (Parkville)
From 2023 most subjects will be taught on campus only with flexible options limited to a select number of postgraduate programs and individual subjects.
To learn more, visit COVID-19 course and subject delivery.
Overview
Availability | Semester 1 - Dual-Delivery |
---|---|
Fees | Look up fees |
AIMS
Much of the world's knowledge is stored in the form of text, and accordingly, understanding and harnessing knowledge from text are key challenges. In this subject, students will learn computational methods for working with text, in the form of natural language understanding, and language generation. Students will develop an understanding of the main algorithms used in natural language processing, for use in a diverse range of applications including machine translation, text mining, sentiment analysis, and question answering. The programming language used is Python.
INDICATIVE CONTENT
Topics covered may include:
- Text classification and unsupervised topic discovery
- Vector space models for natural language semantics
- Structured prediction for tagging
- Syntax models for parsing of sentences and documents
- N-gram language modelling
- Automatic translation, and multilingual methods
- Relation extraction and coreference resolution
Intended learning outcomes
On completion of this subject the student is expected to:
- Identify basic challenges associated with the computational modelling of natural language
- Understand and articulate the mathematical and/or algorithmic basis of common techniques used in natural language processing
- Implement relevant techniques and/or interface with existing libraries
- Carry out end-to-end research experiments, including evaluation with text corpora as well as presentation and interpretation of results
Generic skills
On completing this subject, students should have the following skills:
- Formulate and implement algorithmic solutions to computational problems, with reference to the research literature
- Apply a systems approach to complex problems, and design for operational efficiency
- Design, implement and test programs for small and medium size problems in the Python programming language
Last updated: 31 January 2024
Eligibility and requirements
Prerequisites
One of
Code | Name | Teaching period | Credit Points |
---|---|---|---|
COMP30027 | Machine Learning | Semester 1 (Dual-Delivery - Parkville) |
12.5 |
COMP90049 | Introduction to Machine Learning |
Semester 2 (Dual-Delivery - Parkville)
Semester 1 (Dual-Delivery - Parkville)
|
12.5 |
COMP90051 | Statistical Machine Learning |
Semester 1 (Dual-Delivery - Parkville)
Semester 2 (Dual-Delivery - Parkville)
|
12.5 |
COMP30018 Knowledge Technologies
OR
Admission into one of the following: 100pt Program course entry point in the MC-IT Master of Information Technology, 150pt Program course entry point in the MC-IT Master of Information Technology
AND
Admission into or selection of one of the following:
- Artificial Intelligence specialisation (formal) in the MC-IT Master of Information Technology
- Distributed Computing specialisation (formal) in the MC-IT Master of Information Technology
- Computing specialisation (formal) in the MC-IT Master of Information Technology
Corequisites
None
Non-allowed subjects
433-460 Human Language Technology
433-467 Text and Document Management
433-660 Human Language Technology
433-667 Text and Document Management
433-476 Text and Document Management
Inherent requirements (core participation requirements)
The University of Melbourne is committed to providing students with reasonable adjustments to assessment and participation under the Disability Standards for Education (2005), and the Assessment and Results Policy (MPF1326). Students are expected to meet the core participation requirements for their course. These can be viewed under Entry and Participation Requirements for the course outlines in the Handbook.
Further details on how to seek academic adjustments can be found on the Student Equity and Disability Support website: http://services.unimelb.edu.au/student-equity/home
Last updated: 31 January 2024
Assessment
Description | Timing | Percentage |
---|---|---|
3 short programming assignments.
| From Week 3 to Week 10 | 25% |
A final, open-ended (in terms of methodology) research project, due at the end of the semester.
| From Week 11 to Week 12 | 35% |
One open-book online examination.
| End of semester | 40% |
Additional details
Hurdle Requirement:
Hurdle: 50% overall 30/60 for continuous assessment 20/40 for final exam
Last updated: 31 January 2024
Dates & times
- Semester 1
Principal coordinator Jey Han Lau Mode of delivery Dual-Delivery (Parkville) Contact hours 36 hours, comprising of one 2-hour lecture and one 1-hour workshop per week Total time commitment 200 hours Teaching period 28 February 2022 to 29 May 2022 Last self-enrol date 11 March 2022 Census date 31 March 2022 Last date to withdraw without fail 6 May 2022 Assessment period ends 24 June 2022 Semester 1 contact information
Jey Han Lau
Time commitment details
200 hours
Last updated: 31 January 2024
Further information
- Texts
- Subject notes
LEARNING AND TEACHING METHOD
The subject comprises a weekly 2 hour lecture followed by a 1 hour laboratory exercise. Weekly readings are assigned from relevant textbooks and the research literature, and weekly laboratory exercises are assigned. Additionally, a significant amount of project work is assigned.
INDICATIVE KEY LEARNING RESOURCES
At the beginning of the semester, the coordinator will post a list of readings taken from relevant textbooks as well as research literature and research monographs. An indicative source of relevant material is the textbook Speech and Language Processing by Dan Jurafsky and James H. Martin (2008).
CAREERS / INDUSTRY LINKS
A growing sector of the IT industry is concerned with leveraging the information that is locked up in semi-structured text data on the web. Large scale analysis and exploitation of this information depends on graduates with a solid grounding in natural language processing and text retrieval algorithms, and experience with implementing systems that are informed by the research literature.
- Related Handbook entries
This subject contributes to the following:
Type Name Course Master of Data Science Course Master of Science (Computer Science) Course Ph.D.- Engineering Course Doctor of Philosophy - Engineering Course Master of Philosophy - Engineering Specialisation (formal) Computing Specialisation (formal) Distributed Computing Specialisation (formal) Software Major Computer Science - Available through the Community Access Program
About the Community Access Program (CAP)
This subject is available through the Community Access Program (also called Single Subject Studies) which allows you to enrol in single subjects offered by the University of Melbourne, without the commitment required to complete a whole degree.
Please note Single Subject Studies via Community Access Program is not available to student visa holders or applicants
Entry requirements including prerequisites may apply. Please refer to the CAP applications page for further information.
Additional information for this subject
Subject coordinator approval required
- Available to Study Abroad and/or Study Exchange Students
Last updated: 31 January 2024