Course Project

The course project is an open-ended, hands-on opportunity to explore an aspect of statistical NLP. Examples include ...

  • Summarize your participation in a Kaggle competition or shared task with an upcoming deadline
  • Compare the capabilities of several open source generative decoders-only transformers (ex. GPT-style models) on a particular task
  • Design and create a synthetic dataset, and demonstrate of how it can help to address a particular gap in capabilities
  • Fine-tune a multimodal LLM for a new task (or language) and report its performance

Rubric

Your course project proposal will be assessed on a Superior/Pass/Fail basis.

Superior (3 pts)

Pass criteria + all of the following:

  • project is appropriately scoped to the course (i.e., not overly ambitious)
  • rough timeline (with dates)

Pass (2 pts)

All of the following:

  • pull request submitted to course blog repository by the deadline
  • clear outline of the goals of your final project
  • project proposal investigates some aspect of statistical NLP
  • post includes a link to a course project code repository
  • the code repository uses the required assignment template (i.e., the repository is owned by the appropriate GitHub organization)

NOTE: Missing one or more of the above will result in a 1 point deduction.

Fail (0 pts)

Submission is empty or altogether absent

cd ~/👾 Bug?Creative Commons License