In-Memory Computing Summit 2016

In-Memory Computing Summit 2016

The best minds of the In-Memory Computing industry will gather in San Francisco on May 23-24 for IMC Summit 2016 to network, learn and exchange ideas that will power the future of ...

Big Data Analytics at FaceBook

Big Data Analytics at FaceBook

This meetup will be an “unconference” style one and will have various presentations to choose from. Please review the topics below and upon registering, select your 2 preferred top ...

Data Warehousing With Google BigQuery

Data Warehousing With Google BigQuery

Data warehousing and the resulting business intelligence are the basic necessities of business today. And today’s technologies makes it possible to have a sophisticated data wareho ...

Innovative Big Data Application Optimizes Lead Conversions, built on the Google Cloud Platform – CASE STUDY

Innovative Big Data Application Optimizes Lead Conversions, built on the Google Cloud Platform – CASE STUDY

In the era of Big Data, many enterprise executives are struggling with the sheer volume of available data and how to transform all that information into intelligence they can use t ...

Predictive policing: The future of law enforcement

Predictive policing: The future of law enforcement

As Dj Das, founder and CEO of Third Eye Consulting Services, sums it up, “For fighting crime and keeping every citizen safe, Microsoft has the most sophisticated cloud-based big da ...

Big Data Analytics at FaceBook

This meetup will be an “unconference” style one and will have various presentations to choose from. Please review the topics below and upon registering, select your 2 preferred topics that you’d like to hear discussed the night of the event! Agenda 6:30pm Event Doors open 6:30pm - 7:30pm Happy Hour & Networking 7:30pm - 7:45pm Keynote 7:45pm - 8:30pm 2 Presentations and Q&A 8:30pm-9:30pm Happy Hour ...

Read more

Data Warehousing With Google BigQuery

Data warehousing and the resulting business intelligence are the basic necessities of business today. And today’s technologies makes it possible to have a sophisticated data warehouse up and running in the clouds at a price and scale that was never possible before.     This webinar showcases the reasons, ways and means of developing such modern day data warehouses using Google BigQuery.   ...

Read more

Analyze Realtime Data from Amazon Kinesis Streams Using Zeppelin and Spark Streaming – AWS Big Data Blog

There is streaming data everywhere. This includes clickstream data, data from sensors, data emitted from billions of IoT devices, and more. Not suprisingly, data scientists want to analyze and explore these data streams in real time. This post shows you how you can use Spark Streaming to process data coming fromAmazon Kinesis streams, build some graphs using Zeppelin, and then store the Zeppelin notebook in ...

Read more

Distributed, Real-time Joins and Aggregations on User Activity Events using Kafka Streams

In previous blog posts we introduced Kafka Streams and demonstrated an end-to-end Hello World streaming application that analyzes Wikipedia real-time updates through a combination of Kafka Streams and Kafka Connect. In this blog post we want to continue the introduction series on Kafka Streams by implementing a very common and very important use case in stream processing: to enrich an incoming stream of eve ...

Read more

How-to: Analyze Fantasy Sports with Apache Spark and SQL (Part 2: Data Exploration) – Cloudera Engineering Blog

Learn how analyzing stats from professional sports leagues is an instructive use case for data analytics using Apache Spark with SQL. Covered in this installment: data exploration with Apache Impala (incubating) and Hue. In Part 1 of this series, I introduced the topic of using fantasy sports analytics as an instructive use case for exploring the Apache Hadoop ecosystem. In that installment, we focused on d ...

Read more

How-to: Detect and Report Web-Traffic Anomalies in Near Real-Time – Cloudera Engineering Blog

This framework based on Apache Flume, Apache Spark Streaming, and Apache Impala (incubating) can detect and report on abnormal bad HTTP requests within seconds.    Website performance and availability are mission-critical for companies of all types and sizes, not just those with a revenue stream directly tied to the web. Web pages can become unavailable for many reasons, including overburdened backing data ...

Read more

AI, Deep Learning, and Machine Learning: A Primer – Andreessen Horowitz

“One person, in a literal garage, building a self-driving car.” That happened in 2015. Now to put that fact in context, compare this to 2004, when DARPA sponsored the very first driverless car Grand Challenge. Of the 20 entries they received then, the winning entry went 7.2 miles; in 2007, in the Urban Challenge, the winning entries went 60 miles under city-like constraints.Things are clearly progressing ra ...

Read more

Achieving End-to-end Security for Apache Spark with Databricks

Holistic Security for the Big Data Lifecycle Traditionally, enterprise organizations only had security solutions that addressed parts of their big data infrastructure. Today, enterprises demand holistic security that covers the full spectrum of their big data lifecycle: from file processing, big data clusters, code management, job workflows, application deployments, dashboards, to reports. The Databricks ju ...

Read more

C3 IoT and AWS to Seize Global Enterprise IoT Opportunity

Full-Stack IoT Development Platform Provider Joins Forces With Leading Cloud Infrastructure Provider to Accelerate Delivery of IoT Applications and Business Results REDWOOD CITY, CA--(Marketwired - June 16, 2016) - C3 IoT™, a global leader in enterprise IoT platform and applications software, today announced a new level of cooperation with Amazon Web Services (AWS) to deliver a tightly integrated, end-to-en ...

Read more

CERN Just Released 300 Terabytes Worth Of Data To The Public | IFLScience

If you’ve ever dreamed of working on the largest experiment in the world, you can now make that dream a reality from the comfort of your own home. CERN has just released more than 300 terabytes (TB) of high-quality open data from its CMS collaboration. The data includes 100 TB collected at the Large Hadron Collider (LHC) by the CMS detector in 2011. This includes raw datasets used by the scientists, as well ...

Read more

Open Sourcing Photon ML | LinkedIn Engineering

Machine learning has the best chance of achieving meaningful return on investment when companies model previous success. At last week’s Applied Artificial Intelligence conference in San Francisco, Uber’s Head of Machine Learning Danny Lange laid out his four principles for simplifying the process of applying machine learning in business. Lange has witnessed firsthand the evolution of machine learning techno ...

Read more

Apache Spark for Azure HDInsight now generally available | Blog | Microsoft Azure

Today, we are pleased to announce that Apache Spark v1.6.1 for Azure HDInsight is generally available. Since we announced the public preview, Spark for HDInsight has gained rapid adoption and is now 50% of all new HDInsight clusters deployed. With GA, we are revealing improvements we’ve made to the service to make Spark hardened for the enterprise and easy for your users. This includes improvements to the a ...

Read more

2015 © Big Data Cloud Inc. All Rights Reserved.

Hadoop and the Hadoop elephant logo, Sprark are trademarks of the Apache Software Foundation.

Scroll to top