Skip to content
#

sqoop

Here are 34 public repositories matching this topic...

MapReduce Job Development, RDDs Programming, Medical Data Management, Sales Analysis, And Efficient Data Integration For Big Data Analysis. Spark: Big Data Processing, SQOOP Integration, And Spark Structured Streaming For Real-Time Data.

  • Updated Jun 7, 2023
  • Java

A CRISP-DM–based big data pipeline for predicting NYC ride-sharing trip fares: ingesting 2024 TLC data via Sqoop into HDFS/Hive, performing ETL and feature engineering with Spark & PySpark, training and tuning Linear Regression & Gradient Boosted Tree models, and outlining end-to-end deployment.

  • Updated May 29, 2025
  • Java

Improve this page

Add a description, image, and links to the sqoop topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the sqoop topic, visit your repo's landing page and select "manage topics."

Learn more