Event box

This introduction to Natural Language Processing (NLP) covers the management and analysis of text using core Python programming language, and the open source libraries NLTK (natural language toolkit) and spaCy. Some prior experience with Python programming will be useful, but is not assumed. The three one-hour workshops will include the following topics:

Friday May 5, 12:00-1:15pm: Text processing in Python

  • strings and their properties
  • strings as iterables, lists
  • comparing and searching strings
  • regular expressions

Friday May 19, 12:00-1:15pm: NLTK

  • text preprocessing (spellchecking, stemming and lemmatization)
  • word contexts, frequency distribution
  • parts-of-speech tagging
  • named entity recognition
  • sentiment analysis

Wednesday May 24, 12:00-1:15pm: spaCy

  • statistical modeling of text
  • word vectors and similarity
  • processing pipelines

Workshops will be held online via Zoom.

Daily Schedule:
11:45am-12:00noon: Setup, discussion, troubleshooting (as needed, optional)
12noon-1:15pm: Instruction (approximately 15 mins at end for optional review, discussion, troubleshooting)

Registration: This event is open to all members of the Caltech community. Please register using the link below. 

Dates & Times:
11:45am - 1:15pm, Friday, May 5, 2023
11:45am - 1:15pm, Friday, May 19, 2023
11:45am - 1:15pm, Wednesday, May 24, 2023
Zoom Session (Online)
Registration has closed. (This event has to be booked as part of a series)

Event Organizer

Stephen Davison