In-Memory Computing Summit 2016

In-Memory Computing Summit 2016

The best minds of the In-Memory Computing industry will gather in San Francisco on May 23-24 for IMC Summit 2016 to network, learn and exchange ideas that will power the future of ...

Big Data Analytics at FaceBook

Big Data Analytics at FaceBook

This meetup will be an “unconference” style one and will have various presentations to choose from. Please review the topics below and upon registering, select your 2 preferred top ...

Data Warehousing With Google BigQuery

Data Warehousing With Google BigQuery

Data warehousing and the resulting business intelligence are the basic necessities of business today. And today’s technologies makes it possible to have a sophisticated data wareho ...

Innovative Big Data Application Optimizes Lead Conversions, built on the Google Cloud Platform – CASE STUDY

Innovative Big Data Application Optimizes Lead Conversions, built on the Google Cloud Platform – CASE STUDY

In the era of Big Data, many enterprise executives are struggling with the sheer volume of available data and how to transform all that information into intelligence they can use t ...

Predictive policing: The future of law enforcement

Predictive policing: The future of law enforcement

As Dj Das, founder and CEO of Third Eye Consulting Services, sums it up, “For fighting crime and keeping every citizen safe, Microsoft has the most sophisticated cloud-based big da ...

Big Data Analytics at FaceBook

This meetup will be an “unconference” style one and will have various presentations to choose from. Please review the topics below and upon registering, select your 2 preferred topics that you’d like to hear discussed the night of the event! Agenda 6:30pm Event Doors open 6:30pm - 7:30pm Happy Hour & Networking 7:30pm - 7:45pm Keynote 7:45pm - 8:30pm 2 Presentations and Q&A 8:30pm-9:30pm Happy Hour ...

Read more

Data Warehousing With Google BigQuery

Data warehousing and the resulting business intelligence are the basic necessities of business today. And today’s technologies makes it possible to have a sophisticated data warehouse up and running in the clouds at a price and scale that was never possible before.     This webinar showcases the reasons, ways and means of developing such modern day data warehouses using Google BigQuery.   ...

Read more

In Japan, Priuses can talk to other Priuses | TechCrunch

While the US waits to get Super Cruise in Cadillacs next year, Toyota has already rolled out a pretty robust V2V, or vehicle-to-vehicle, system in three models available in Japan. The latest version of the Prius, the Lexus RX, and the Toyota Crown, a luxury sedan sold in Japan, all have the ITS Connect system available as an option. The cars communicate using a channel abandoned by analog TV transmissions w ...

Read more

Why some Data Lakes are built to last

Hadoop-based Data Lakes can be game-changers, but too many are under performing. Here's a checklist to make your data lake a wild success. Hadoop-based data lakes can be game changers: better, cheaper and faster integrated enterprise information. Knowledge workers can access data directly, where project cycles are measured in days rather than months, and business users can leverage a shared data source rath ...

Read more

Rapid Big Data Prototyping with Microsoft R Server on Apache Spark: Context Switching & Spark Tuning – Azure Data Lake Blog

During big data application development stage, it’s common to downsize the problem to the local machine for rapid code prototyping iterations. Developing machine learning models with Microsoft R Server (MRS) allows the user to quickly switch between code execution on the local machine and remote big data clusters such as Apache Spark on Azure HDInsight. In this blog, we will demonstrate how to develop a pre ...

Read more

Uber’s case for incremental processing on Hadoop – O’Reilly Media

Uber’s mission is to provide “transportation as reliable as running water, everywhere, for everyone.” To fulfill this promise, Uber relies on making data-driven decisions at every level, and most of these decisions can benefit from faster data processing. For example, using data to understand areas for growth or accessing of fresh data by the city operations team to debug each city. Needless to say, the cho ...

Read more

Securing Apache Spark Shuffle using Apache Commons Crypto – Cloudera Engineering Blog

Learn how the performance advantages of the Crypto cryptographic library will provide an upgrade for Spark shuffle encryption over the current approach. When running a big data computing job, the data being processed may contain sensitive information that users don’t want anyone else to access. Encrypting that sensitive data is becoming more and more important, especially for enterprise users. For Apache Sp ...

Read more

How startups can compete with enterprises in artificial intelligence and machine learning | TechCrunch

When I woke up this morning, I asked my assistant a simple question: “Siri, is it going to rain today?” Siri understood my intent, pulled the local weather data via an API and answered me in less than two seconds: “There’s no rain in the forecast for today.” In the not-too-distant past, this kind of human-computer interaction would have blown away technologists and delighted consumers — but in 2016, it’s no ...

Read more

Analyze Realtime Data from Amazon Kinesis Streams Using Zeppelin and Spark Streaming – AWS Big Data Blog

There is streaming data everywhere. This includes clickstream data, data from sensors, data emitted from billions of IoT devices, and more. Not suprisingly, data scientists want to analyze and explore these data streams in real time. This post shows you how you can use Spark Streaming to process data coming fromAmazon Kinesis streams, build some graphs using Zeppelin, and then store the Zeppelin notebook in ...

Read more

Distributed, Real-time Joins and Aggregations on User Activity Events using Kafka Streams

In previous blog posts we introduced Kafka Streams and demonstrated an end-to-end Hello World streaming application that analyzes Wikipedia real-time updates through a combination of Kafka Streams and Kafka Connect. In this blog post we want to continue the introduction series on Kafka Streams by implementing a very common and very important use case in stream processing: to enrich an incoming stream of eve ...

Read more

How-to: Analyze Fantasy Sports with Apache Spark and SQL (Part 2: Data Exploration) – Cloudera Engineering Blog

Learn how analyzing stats from professional sports leagues is an instructive use case for data analytics using Apache Spark with SQL. Covered in this installment: data exploration with Apache Impala (incubating) and Hue. In Part 1 of this series, I introduced the topic of using fantasy sports analytics as an instructive use case for exploring the Apache Hadoop ecosystem. In that installment, we focused on d ...

Read more

How-to: Detect and Report Web-Traffic Anomalies in Near Real-Time – Cloudera Engineering Blog

This framework based on Apache Flume, Apache Spark Streaming, and Apache Impala (incubating) can detect and report on abnormal bad HTTP requests within seconds.    Website performance and availability are mission-critical for companies of all types and sizes, not just those with a revenue stream directly tied to the web. Web pages can become unavailable for many reasons, including overburdened backing data ...

Read more

2015 © Big Data Cloud Inc. All Rights Reserved.

Hadoop and the Hadoop elephant logo, Sprark are trademarks of the Apache Software Foundation.

Scroll to top