You Are Here: Home » Performance Tuning

Research Blog: Using Machine Learning to predict parking difficulty

"When Solomon said there was a time and a place for everything he had not encountered the problem of parking his automobile." -Bob Edwards, Broadcast Journalist Much of driving is spent either stuck in traffic or looking for parking. With products like Google Maps and Waze, it is our long-standing goal to help people navigate the roads easily and efficiently. But until now, there wasn’t a tool to address th ...

Read more

Ethics — the next frontier for artificial intelligence | TechCrunch

AI’s next frontier requires ethics built through policy. Will Donald Trump deliver? With one foot in its science fiction past and the other in the new frontier of science and tech innovations, AI occupies a unique place in our cultural imagination. Will we live into a future where machines are as intelligent — or frighteningly, more so — than humans? We have already witnessed AI predict the outcome of the l ...

Read more

Use Neural Networks to Find the Best Words to Title Your eBook – Data Science Central

Introduction The eBook business is thriving. The likes of Amazon Kindle, Apple iBookstore, and Google eBookstore all provide a robust variety of channels by which to publish any eBook on any subject you could think of.  Amazon generates an average of 1.07MM in eBook paid sales volume, which translates to about $5.8MM in revenue, every day. A huge community of eBook followers exist due to its proven model to ...

Read more

Monitoring Real-Time Uber Data Using Spark Machine Learning, Streaming, and the Kafka API (Part 1) | MapR

According to Gartner, by 2020, a quarter of a billion connected cars will form a major element of the Internet of Things. Connected vehicles are projected to generate 25GB of data per hour, which can be analyzed to provide real-time monitoring and apps, and will lead to new concepts of mobility and vehicle usage. One of the 10 major areas in which big data is currently being used to excellent advantage is i ...

Read more

Building AI Is Hard—So Facebook Is Building AI That Builds AI | WIRED

DEEP NEURAL NETWORKS are remaking the Internet. Able to learn very human tasks by analyzing vast amounts of digital data, these artificially intelligent systems are injecting online services with a power that just wasn’t viable in years past. They’re identifying faces in photos and recognizing commands spoken into smartphones and translating conversations from one language to another. They’re even helping G ...

Read more

The next AI is no AI | TechCrunch

Artificial Intelligence is starting to turn invisible from the outside in — and vice versa. The exact effects and workings of AI technologies are becoming more challenging to perceive and comprehend for humans. Even the experts themselves don’t always fully understand how an AI system operates. Effectively, as the impact of AI technologies increases, the more limited becomes our ability to understand their ...

Read more

How-to: Improve Apache HBase Performance via Data Serialization with Apache Avro – Cloudera Engineering Blog

Taking a thoughtful approach to data serialization can achieve significant performance improvements for HBase deployments. The question of using tall versus wide tables in Apache HBase is a commonly discussed design pattern (see reference here and here). However, there are more considerations here than making that simple choice. Because HBase stores each column of a table as an independent row in the underl ...

Read more

Hadoop performance troubleshooting with stack tracing, an introduction. | Databases at CERN

This post is about profiling and performance tuning of distributed workloads and in particular Hadoop applications. You will learn of a profiler application we have developed and how it has successfully been applied to tuning Sqoop to improve the throughput of data transfer from Oracle to Hadoop. Where is my Sqoop job spending CPU time? One of the data feeds into our Hadoop service is from Oracle databases. ...

Read more

Open Sourcing Dr. Elephant: Self-Serve Performance Tuning for Hadoop and Spark | LinkedIn Engineering

We are proud to announce today that we are open sourcing Dr. Elephant, a powerful tool that helps users of Hadoop and Spark understand, analyze, and improve the performance of their flows. We first presented Dr. Elephant to the community last year during the eighth annual Hadoop Summit, a leading conference for the Apache Hadoop community. Our Motivation Hadoop is a framework that facilitates the distribute ...

Read more

2015 © Big Data Cloud Inc. All Rights Reserved.

Hadoop and the Hadoop elephant logo, Sprark are trademarks of the Apache Software Foundation.

Scroll to top