Real-time event analysis

I just finished a workshop covering the use of Data Stream Analysis. Its necessity is driven by the need to analyze massive volumes of data (e.g. system and network events) in near real time – essential given how fast you will hit your head on the insertion rate ceiling using standard relational databases.

Off the shelf DBs (PostgreSQL, MySQL, Oracle, etc.) are unable to simultaneously commit thousands of events per second while performing complex queries. To have a chance of analyzing events in reasonable amounts of time you must analyze the incoming streams of data before inserting the data into a database.

I ran into this scenario last year building a central log server using off the shelf components. Even a few dozen servers can stream events fast enough where you realize pretty quickly all the typical open source based how-to’s on building a system that can store, correlate, and alert are inadequate. Data stream processing is required when things get big.

Posted by tate 18/03/2007 at 11h17


Comments

  1. Squarepuller 18/03/2007 at 20h16

    Perhaps I’m missing it, but where are the papers for download/reading? What’d you learn that’d be operationally useful for us security geeks? Any tools/software that you found as a good solution to your problem from last year?

    (Also, what’s up with this comment system? Very finicky. No non-ajax posts allowed?)