• Adobe spindle – Next-generation web analytics processing with Scala, Spark, and Parquet.
  • Apache Kiji – framework to collect and analyze data in real-time, based on HBase.
  • Apache Nutch – open source web crawler.
  • Apache OODT – capturing, processing and sharing of data for NASA’s scientific archives.
  • Apache Tika – content analysis toolkit.
  • Countly – open source mobile and web analytics platform, based on Node.js & MongoDB.
  • Domino – Run, scale, share, and deploy models — without any infrastructure.
  • Eclipse BIRT – Eclipse-based reporting system.
  • Eventhub – open source event analytics platform.
  • Hermes – asynchronous message broker built on top of Kafka.
  • HIPI Library – API for performing image processing tasks on Hadoop’s MapReduce.
  • Hunk – Splunk analytics for Hadoop.
  • Imhotep – Large scale analytics platform by indeed.
  • MADlib – data-processing library of an RDBMS to analyze data.
  • Kylin – open source Distributed Analytics Engine from eBay.
  • PivotalR – R on Pivotal HD / HAWQ and PostgreSQL.
  • Qubole – auto-scaling Hadoop cluster, built-in data connectors.
  • Sense – Cloud Platform for Data Science and Big Data Analytics.
  • Snowplow – enterprise-strength web and event analytics, powered by Hadoop, Kinesis, Redshift and Postgres.
  • SparkR – R frontend for Spark.
  • Splunk – analyzer for machine-generated data.
  • Sumo Logic – cloud based analyzer for machine-generated data.
  • Talend – unified open source environment for YARN, Hadoop, HBASE, Hive, HCatalog & Pig.
  • Warp – query by example tool for big data (OS X app)

Leave a Reply

Your email address will not be published. Required fields are marked *