Skip to content

Archive for

11
Jun

Data/BI/Analytics Evolution @ NetFlix


More data + Better models + More accurate metrics + Better approaches & architectures = Lots of room for improvement!

netflixIt’s amazing to watch how quickly the data engineering / analytics/ reporting/ modeling/ visualization toolset is evolving in the BI ecosytem.

There are clearly massive foundational shifts taking place around big data. I am not sure how large conventional Fortune 500 firms can innovate and keep up with what’s going on.  I have run into CIOs who have not heard of Hadoop in some cases.

It’s also fascinating to see how data-driven “bleeding” edge firms like NetFlix are pushing the envelope.  Netflix stats are amazing:  1/3+ Internet traffic (NA / peak);  100+ Million hours per day; 65+ Million members / 50+ countries; 500 Billion Events / Day.

NetFlix is clearly reinventing Television and targeting 90 million potential subs in the US market alone.  Binge-watching, cord-cutting are now part of our everyday lingo. What most people don’t realize is how data-driven Netflix is…. from “giving viewers what they want” to “leveraging data mining to boost subscriber base”.

Viewing -> Improved Personalization -> Better Experience is the virtuous circle.

Here is a glimpse at how their BI landscape has evolved in the past five years as they integrate 5 million to 6 million net adds for several years now.  The figures are from a presentation by Blake Irvine, Manager Data Science and Engineering.

BI tools @ NetFlix pre-Hadoop

Read more »

2
Jun

The NoSQL and Spark Ecoystem: A C-Level Guide


EvolutionofDBMS

New Technologies | New Possibilities

As a C-level executive, it’s becoming clear to me that NoSQL databases and Machine Learning toolsets like Spark are going to play an increasingly big role in data-driven business models, low-latency architecture & rapid application development (projects that can be done in 8-12 weeks not years).

The best practice firms are making this technology shift as decreasing storage costs have led to an explosion of big data. Commodity cluster software, like Hadoop, has made it 10-20x cheaper to store large datasets.

After spending two days at the leading NoSQL provider  MongoDB World event in NYC, I was pleasantly surprised to see the amount of innovation and size of user community around document centric databases like MongoDB.

Data Driven Insight Economy

It doesn’t take genius to realize that data driven business models, high volume data feeds, mobile first customer engagement, and cloud are creating new distributed database requirements. Today’s modern online and mobile applications need continuous availability, cost effective scalability and high-speed analytics to deliver an engaging customer experience.

We know instinctively that there is value in all the data being captured in the world around out…no question is no longer “if there is value” but “how to extract that value and apply it to the business to make a difference”.

Legacy relational databases fail to meet the requirements of digital and online applications for the following reasons:

Read more »

%d bloggers like this: