Blog

Introducing StreamSets Data Collector 5.7

Related products: StreamSets Data Collector Engine

We’re thrilled to introduce two game-changing enhancements to StreamSets Data Collector (SDC):

  1. Support for Parquet Data: Unlock the Power of Efficiency 

    In SDC 5.7, we’re unveiling Parquet as a data type across multiple destinations. While it’s in “Technical Preview,” it brings optimization to data storage, enabling future data analysis improvements. 

    Benefits
  • Optimizes data storage for enhanced performance
  • Sets the stage for advanced data analysis 
  • Allows you to use Parquet in destinations like Local FS, Hadoop File System (HDFS), Databricks, AWS S3, ADLS2, Azure Blob Storage, Google Cloud Storage (GCS), BigQuery, and Snowflake

    While Parquet is in “Technical Preview,” we encourage you to explore its potential and share your feedback. Your insights will help us fine-tune this feature for seamless production use. n

 

  1. MongoDB Atlas Lookup Processor: Simplifying Data Access 

    If you are a MongoDB Atlas user, now you will be able to perform streamlined data lookup. Say goodbye to complex data retrieval scripts. 

    Benefits
  • Simplifies MongoDB Atlas data lookup
  • Compatible with on-premise MongoDB (older versions) 
  • Eliminates the need for intricate data retrieval processes

At StreamSets, we’re committed to providing solutions that empower you to make the most of your data. Login to StreamSets now to explore these new features!

 

Be the first to reply!