Course Outline
Introduction
- Spark NLP vs NLTK vs spaCy
- Overview of Spark NLP features and architecture
Getting Started
- Setup requirements
- Installing Spark NLP
- General concepts
Using Pre-trained Pipelines
- Importing required modules
- Default annotators
- Loading a pipeline model
- Transforming texts
Building NLP Pipelines
- Understanding the pipeline API
- Implementing NER models
- Choosing embeddings
- Using word, sentence, and universal embeddings
Classification and Inference
- Document classification use cases
- Sentiment analysis models
- Training a document classifier
- Using other machine learning frameworks
- Managing NLP models
- Optimizing models for low-latency inference
Troubleshooting
Summary and Next Steps
Requirements
- Familiarity with Apache Spark
- Python programming experience
Audience
- Data scientists
- Developers
Delivery Options
Private Group Training
Our identity is rooted in delivering exactly what our clients need.
- Pre-course call with your trainer
- Customisation of the learning experience to achieve your goals -
- Bespoke outlines
- Practical hands-on exercises containing data / scenarios recognisable to the learners
- Training scheduled on a date of your choice
- Delivered online, onsite/classroom or hybrid by experts sharing real world experience
Private Group Prices RRP from £3800 online delivery, based on a group of 2 delegates, £1200 per additional delegate (excludes any certification / exam costs). We recommend a maximum group size of 12 for most learning events.
Contact us for an exact quote and to hear our latest promotions
Public Training
Please see our public courses
Testimonials (5)
A lot of practical examples, different ways to approach the same problem, and sometimes not so obvious tricks how to improve the current solution
Rafal - Nordea
Course - Apache Spark MLlib
The live examples
Ahmet Bolat - Accenture Industrial SS
Course - Python, Spark, and Hadoop for Big Data
very interactive...
Richard Langford
Course - SMACK Stack for Data Science
Sufficient hands on, trainer is knowledgable
Chris Tan
Course - A Practical Introduction to Stream Processing
Get to learn spark streaming , databricks and aws redshift