TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
-
Updated
Sep 29, 2023 - Scala
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
This code is used to build & run a Docker container for performing predictions against a Spark ML Pipeline.
JPMML-SparkML plugin for converting LightGBM-Spark models to PMML
Recommendation engine in Java. Based on an ALS algorithm (Apache Spark). Train a new model after N seconds.
A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.
Free High-Quality Financial Data in Azure
Simulation of job offers and CVs with real-time processing, classification, and analytics using Kafka, Ray, Spark, and Databricks. Includes a Flask-based recommendation system and Tableau visualizations.
BigData Engineering Capstone Project with Tech-stack : Linux, MySQL, sqoop, HDFS, Hive, Impala, SparkSQL, SparkML, git
A machine learning at scale demo on flight delay prediction. The project includes an exploration of a series of data transformation and ML pipelines in Apache Spark (via Databricks).
Transformation of Akamai Logs with Spark ETL and discover of Values and similarities in logs used SparkML and H2O ML
Online latent state estimation with Spark
Twitter Sentiment Analysis using Spark, MongoDB, and Google Cloud
Predicting the arrival delay time of commercial flights
Add a description, image, and links to the sparkml topic page so that developers can more easily learn about it.
To associate your repository with the sparkml topic, visit your repo's landing page and select "manage topics."