Data Ingestion

Share on:

Top > Intelligence > Big Data and Analytics > Data Ingestion

  • AWS Data Pipeline - AWS Data Pipeline is a web service to process and move data between different AWS compute and storage services, as well as on-premises data sources, at specified intervals.   🌐
  • AWS Glue - AWS Glue is a managed extract, transform, and load (ETL) service based on Apache Spark.  🌐
  • Azkaban - Azkaban is a batch workflow shceduler created at LinkedIn to run Hadoop Jobs.   🌐
  • Trifacta - Trifacta offers a data ingestion and pipeline solution both for on-prem and cloud settings; it also has a SaaS flavour. Ingestion pipelines and transformation may be written using a visual editor, SQL, or Python.  🌐

Before You Leave

🤘 Subscribe to my 100% spam-free newsletter!

website counters