Wookie - building data products
- Reuse components using Sparkles - data processing monads
- Map over generic lists of functions
- Use some Latitude/Longitude goodies
- Provides base classes for writing:
- CLI applications
- Spark / Spark Streaming jobs
- REST APIs
- Provides collector API and sample collectors
- Spark SQL Server - automatically register tables given directory
- Data ingestion - REST API for pushing data to Kafka queue
- Prediction server - exposes Spark MLLib models as REST APIs
Modules
-
app-api - base classes/objects that helps writing basic commandline applications
-
web-api - base classes/objects that helps writing REST APIs
-
spark-api - utility classes/objects for writing Spark / Spark Streaming applications
-
spark-api-kafka - utility classes/objects for writing Spark Streaming applications using Kafka input streams
-
spark-api-twitter - utility classes/objects for writing Spark Streaming applications using Twitter input streams
-
oracle - REST API server that predicts new data points based on the model (WIP)
-
pumper - REST API server that push data to kafka queue (WIP)
-
sqlserver - Spark SQL server that automatically register and refresh tables given root directory, supports local file formats like json, csv, parquet as well as remote ones like Cassandra or Elasticsearch
-
examples:
Dependencies
Authors and Contributors
@elyast