21 Repositories
Java analytics Libraries
Implementation of Enhancing cubes with models to describe multidimensional data.
Implementation of Enhancing cubes with models to describe multidimensional data.
DatasetCreator is a lightweight RESTFul client implementation of the Salesforce CRM Analytics External Data API.
DatasetCreator is a lightweight RESTFul client implementation of the Salesforce CRM Analytics External Data API. It has been deliberately developed with no 3rd party jars with the goal of being a lean, reliable and scalable solution.
A Time Series Data Browser
Contents What is binjr? Features Getting started Trying it out Getting help Contributing How is it licensed? What is binjr? binjr is a time series bro
An exploration for running analytics on JDK Flight Recorder recordings
JFR Analytics An exploration for running analytics on JDK Flight Recorder recordings. There's two areas of interest: Pull-based SQL queries on JFR rec
Elide is a Java library that lets you stand up a GraphQL/JSON-API web service with minimal effort.
Elide Opinionated APIs for web & mobile applications. Read this in other languages: 中文. Table of Contents Background Documentation Install Usage Secur
A Spigot plugin that tracks statistics with InfluxDB for timescale analytics.
RiceStats I made this plugin for my SMP server. It's able to track statistics and send it to InfluxDB where it can be processed by an analytics progra
Scalable Time Series Data Analytics
Time Series Data Analytics Working with time series is difficult due to the high dimensionality of the data, erroneous or extraneous data, and large d
Dagger is an easy-to-use, configuration over code, cloud-native framework built on top of Apache Flink for stateful processing of real-time streaming data.
Dagger Dagger or Data Aggregator is an easy-to-use, configuration over code, cloud-native framework built on top of Apache Flink for stateful processi
Dremio - the missing link in modern data
Dremio Dremio enables organizations to unlock the value of their data. Documentation Documentation is available at https://docs.dremio.com. Quickstart
FLiP: StreamNative: Cloud-Native: Streaming Analytics Using Apache Flink SQL on Apache Pulsar
StreamingAnalyticsUsingFlinkSQL FLiP: StreamNative: Cloud-Native: Streaming Analytics Using Apache Flink SQL on Apache Pulsar Running on NVIDIA XAVIER
OpenSearch is an open source distributed and RESTful search engine.
OpenSearch is an open source search and analytics engine derived from Elasticsearch
:elephant: Elasticsearch real-time search and analytics natively integrated with Hadoop
Elasticsearch Hadoop Elasticsearch real-time search and analytics natively integrated with Hadoop. Supports Map/Reduce, Apache Hive, Apache Pig, Apach
Apache Druid: a high performance real-time analytics database.
Website | Documentation | Developer Mailing List | User Mailing List | Slack | Twitter | Download Apache Druid Druid is a high performance real-time a
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Apache Zeppelin Documentation: User Guide Mailing Lists: User and Dev mailing list Continuous Integration: Contributing: Contribution Guide Issue Trac
Apache Druid: a high performance real-time analytics database.
Website | Documentation | Developer Mailing List | User Mailing List | Slack | Twitter | Download Apache Druid Druid is a high performance real-time a
CrateDB is a distributed SQL database that makes it simple to store and analyze massive amounts of machine data in real-time.
About CrateDB is a distributed SQL database that makes it simple to store and analyze massive amounts of machine data in real-time. CrateDB offers the
Gephi - The Open Graph Viz Platform
Gephi - The Open Graph Viz Platform Gephi is an award-winning open-source platform for visualizing and manipulating large graphs. It runs on Windows,
The foundational library of the Morpheus data science framework
Introduction The Morpheus library is designed to facilitate the development of high performance analytical software involving large datasets for both
Apache Spark - A unified analytics engine for large-scale data processing
Apache Spark Spark is a unified analytics engine for large-scale data processing. It provides high-level APIs in Scala, Java, Python, and R, and an op
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Trino is a fast distributed SQL query engine for big data analytics. See the User Manual for deployment instructions and end user documentation. Devel
Apache Druid: a high performance real-time analytics database.
Website | Documentation | Developer Mailing List | User Mailing List | Slack | Twitter | Download Apache Druid Druid is a high performance real-time a