Scalable streaming analytics using a single-server
For many organizations real-time1 analytics entails complex event processing systems (CEP) or newer distributed stream processing frameworks like Storm, S4, or Spark Streaming. The latter have become...
View ArticlePattern-detection and Twitter’s Streaming API
Researchers and companies who need social media data frequently turn to Twitter’s API to access a random sample of tweets. Those who can afford to pay (or have been granted access) use the more...
View ArticleNear realtime, streaming, and perpetual analytics
Simple example of a near realtime app built with Hadoop and HBase Over the past year Hadoop emerged from its batch processing roots and began to take on interactive and near realtime applications....
View ArticleStream Processing and Mining just got more interesting
Largely unknown outside data engineering circles, Apache Kafka is one of the more popular open source, distributed computing projects. Many data engineers I speak with either already use it or are...
View ArticleStream Mining essentials
A series of open source, distributed stream processing frameworks have become essential components in many big data technology stacks. Apache Storm remains the most popular, but promising new tools...
View ArticleExpanding options for mining streaming data
Stream processing was in the minds of a few people that I ran into over the past week. A combination of new systems, deployment tools, and enhancements to existing frameworks, are behind the recent...
View ArticleFour short links: 30 October 2013
Offline.js — Javascript library so web app developers can gracefully deal with users going offline. Android Guides — lots of info on coding for Android. Statistics Done Wrong — learn from these...
View ArticleFour short links: 13 March 2014
Is Parallel Programming Hard? And, If So, What Can You Do About It? — book by Paul E. McKenney, on single-machine multi-CPU parallel programming. Malignant Computation — The bitcoin mining network...
View ArticleJava 8 streams API and parallelism
In the last post in this series, we learned about functional interfaces and lambdas. In particular, we looked at the ITrade functional interface, and filtered a collection of trades to generate a...
View Article4 things to make your Java 8 code more dynamic
Java 8 has a few new features which should help you write more dynamic code. Of course one of the big features was the addition of a lambda syntax. But what about some of the other features that were...
View Article