StreamSets Administrator Training

HANDS ON TRAINING

BUILD PROVEN SKILLS

3 DAY COURSE

StreamSets Administrator Course Overview

High-performance deployments deserve high-caliber training. The fastest, most reliable way to build proven skills in StreamSets is via expert instructor-led hands-on classroom training in a structured learning environment.

This 3-day hands-on training course provides comprehensive coverage for administrators of StreamSets Control Hub (SCH), StreamSets Data Collector (SDC), and StreamSets Transformer. Students will learn how to install, configure, and manage SCH, SDC, and Transformer. In this course, students will learn about StreamSets architecture, deployment design, monitoring and tuning, security, and upgrades. Throughout the course, hands-on exercises reinforce the concepts being discussed.

Requirements

Students preferably should have a general knowledge of operating systems, networking, programming concepts, and databases.

Audience

The course is designed for system administrators, data flow engineers, and systems architects who will be managing, monitoring, and administering StreamSets environments. No prior knowledge of StreamSets is required.

Objectives

Introduction
Lab environment
Course Resources

Overview
Overview of the StreamSets Platform
Users and use cases
Architecture Overview
Installation and licensing options

Development Models, Execution Models, and Architecture
ETL as a common data processing pattern
Deployment architecture and reference architectures
Development models
Execution models

Prerequisites & Installation
Infrastructure requirements
Software requirements
Installation options

SCH Administration
Cloud Management
On-premises nuances
Creating users, groups, roles, organizations
Connectivity
Configuration

Monitoring, Alerts, Logs, and Troubleshooting
Identifying pipeline throughput
Identifying and isolating bottlenecks
Data rules and alerting
Logs and log levels
Jobs and topology monitoring
Garbage collection

Connectivity
Pipeline Origin types
Pipeline Destinations
Connection Catalog

Security
SCH Security Components
Users, roles, organizations, permissions
SAML Authentication
LDAP Authentication
Best Practices

Pipeline Deployment Options
The Pipeline Repository
Pipeline development lifecycle
Testing pipelines
Pipeline CI/CD

Upgrading SCH, SDC, and Transformer
Upgrading pipelines
Upgrading StreamSets components

High Availability and Disaster Recovery
HA and DR options
Scaling and performance

SDC Installation and Configuration
SDC installation requirements
Installation and registration process
SDC configuration and management

Transformer Installation and Configuration
Transformer installation requirements
Installation and registration process
Transformer configuration and management

Roadmap & Wrap-up
Roadmap and planning for the future
Additional resources and support