In-Memory Computing Summit 2016

In-Memory Computing Summit 2016

The best minds of the In-Memory Computing industry will gather in San Francisco on May 23-24 for IMC Summit 2016 to network, learn and exchange ideas that will power the future of ...

Big Data Analytics at FaceBook

Big Data Analytics at FaceBook

This meetup will be an “unconference” style one and will have various presentations to choose from. Please review the topics below and upon registering, select your 2 preferred top ...

Data Warehousing With Google BigQuery

Data Warehousing With Google BigQuery

Data warehousing and the resulting business intelligence are the basic necessities of business today. And today’s technologies makes it possible to have a sophisticated data wareho ...

Innovative Big Data Application Optimizes Lead Conversions, built on the Google Cloud Platform – CASE STUDY

Innovative Big Data Application Optimizes Lead Conversions, built on the Google Cloud Platform – CASE STUDY

In the era of Big Data, many enterprise executives are struggling with the sheer volume of available data and how to transform all that information into intelligence they can use t ...

Predictive policing: The future of law enforcement

Predictive policing: The future of law enforcement

As Dj Das, founder and CEO of Third Eye Consulting Services, sums it up, “For fighting crime and keeping every citizen safe, Microsoft has the most sophisticated cloud-based big da ...

Playing with 80 Million Amazon Product Review Ratings Using Apache Spark

Amazon product reviews and ratings are a very important business. Customers on Amazon often make purchasing decisions based on those reviews, and a single bad review can cause a potential purchaser to reconsider. A couple years ago, I wrote a blog post titled A Statistical Analysis of 1.2 Million Amazon Reviews, which was well-received. Back then, I was only limited to 1.2M reviews because attempting to pro ...

Read more

Analysis of the USA election of 2016 with Apache Spark GraphX and Neo4j | articles about programming on mkdev

Fig. 1 - The most popular tweets by Hillary Clinton and Donald Trump after the election has ended Almost right before the election started, I decided that it might have been interesting to analyse what people think and, more importantly, say on this topic. Because, as you know, this election had promised to be an extraordinary one. This is when I came up with an idea to utilise Twitter's streaming API to co ...

Read more

Overview

Myria Big Data as a Service Myria is a distributed, shared-nothing Big Data management system and Cloud service from the University of Washington. We derive requirements from real users and complex workflows, especially in science. Extracting knowledge out of Big Data today is a high-touch business, requiring a human expert who deeply understands the application domain as well as a growing ecosystem of comp ...

Read more

How-to: Fuzzy Name Indexing in Apache Hadoop with Rosette and Cloudera Search – Cloudera Engineering Blog

In this guide, learn how to use Cloudera Search with Basis Technology’s Rosette®  to perform fuzzy name searches in multiple languages and scripts. Our thanks to Basis Technology team (Jeanne Le Garrec, Hannah MacKenzie-Margulies and Brian Sawyer) for supporting writing this how-to blog. Cloudera Search, powered by Apache Solr brings full-text, interactive search, and scalable indexing to Apache Hadoop by m ...

Read more

Apache Beam and Spark: New coopetition for squashing the Lambda Architecture? | ZDNet

The nice thing about open source projects and standards is that there are so many of them to choose from. And on January 10, the Apache community welcomed Beam as its latest "top level" project (getting top level means your project has made it to prime time in Apache). Google traditionally kept its technology to itself, typically publishing research papers that the open source community would then reinvent ...

Read more

This company is using Amazon Snowmobile to transfer petabytes of data to the cloud

One of the most dramatic announcements from Amazon Web Services at its 2016 re:Invent conference was the announcement of Snowmobile: It’s a 45’ semi truck that trailers a data center on wheels. Customers can load it up with up to 100 petabytes of data per Snowmobile, which is then driven to an AWS data center and loaded into the company’s cloud. It begs the question: Who’s actually using this? DigitalGlobe ...

Read more

BMW’s vision for connected cars includes Cortana in your dash | Windows Central

BMW is at CES showing off a range of tech it envisions as part of a connected car experience. While the entire vision is a neat look into where BMW expects the near-future of connected, automated cars to go, one particular inclusion will stick out to Microsoft fans: Cortana. AS shown off as part of CES 2017 demo, BMW sees Cortana living in the car's dash to offer the same voice-activated assistance with whi ...

Read more

Microsoft launches a new cloud platform for connected cars | TechCrunch

Microsoft isn’t building its own connected car — but it is launching a new Azure-based cloud platform for car manufacturers that want to use the cloud to power their own connected-car services. The new Microsoft Connected Vehicle Platform will go live as a public preview later this year. “This is not an in-car operating system or a ‘finished product’,” Microsoft’s EVP for business development Peggy Johnson ...

Read more

Deep Learning on Databricks – The Databricks Blog

We are excited to announce the general availability of Graphic Processing Unit (GPU) and deep learning support on Databricks! This blog post will help users get started via a tutorial with helpful tips and resources, aimed at data scientists and engineers who need to run deep learning applications at scale. What’s new? Databricks now offers a simple way to leverage GPUs to power image processing, text analy ...

Read more

Nebula as a Storage Platform to Build Airbnb’s Search Backends – Airbnb Engineering & Data Science – Medium

Last year Airbnb grew to a point that a scalable and distributed storage system was required to store data for some applications. For example, personalization data for search grew larger than what a single machine can hold. While we could rebuild just the personalization service to scale up, we foresaw other services to have similar requirements and decided to build a common platform to simplify such tasks ...

Read more

2015 © Big Data Cloud Inc. All Rights Reserved.

Hadoop and the Hadoop elephant logo, Sprark are trademarks of the Apache Software Foundation.

Scroll to top