Blog

Introducing StreamSets Data Collector 5.7

Related products:StreamSets Data Collector Engine

Forum|Forum|2 years ago
September 21, 2023
0 replies
207 views

A

Anonymous

We’re thrilled to introduce two game-changing enhancements to StreamSets Data Collector (SDC):

Support for Parquet Data: Unlock the Power of Efficiency

In SDC 5.7, we’re unveiling Parquet as a data type across multiple destinations. While it’s in “Technical Preview,” it brings optimization to data storage, enabling future data analysis improvements.

Benefits:

Optimizes data storage for enhanced performance
Sets the stage for advanced data analysis
Allows you to use Parquet in destinations like Local FS, Hadoop File System (HDFS), Databricks, AWS S3, ADLS2, Azure Blob Storage, Google Cloud Storage (GCS), BigQuery, and Snowflake

While Parquet is in “Technical Preview,” we encourage you to explore its potential and share your feedback. Your insights will help us fine-tune this feature for seamless production use. n

MongoDB Atlas Lookup Processor: Simplifying Data Access

If you are a MongoDB Atlas user, now you will be able to perform streamlined data lookup. Say goodbye to complex data retrieval scripts.

Benefits:

Simplifies MongoDB Atlas data lookup
Compatible with on-premise MongoDB (older versions)
Eliminates the need for intricate data retrieval processes

At StreamSets, we’re committed to providing solutions that empower you to make the most of your data. Login to StreamSets now to explore these new features!

Couldn't find what you're looking for?

Sign up

Social Login

Login to the community

Social Login

Scanning file for viruses.

This file cannot be downloaded