performance – Alluxio

Channel: performance – Alluxio

Building fast and scalable big data and ML platforms at Pinterest and JD.com

June 5, 2019, 2:47 pm

This Alluxio Meetup features a chance to interact with other Alluxio users and developers, as well as three talks. Thanks to our joint host Data Council! The post Building fast and scalable big data...

View Article

Hybrid Environments for Data Analytics is a Possibility

June 21, 2019, 1:41 pm

As the data ecosystem becomes massively complex and more and more disaggregated, data analysts and end users have trouble adapting and working with hybrid environments. The proliferation of compute...

View Article

Building fast and scalable big data and ML platforms at Pinterest and JD.com

June 21, 2019, 2:47 pm

This talk shares our design, implementation and optimization of Alluxio metadata service to address the scalability challenges, focusing on how to apply and combine techniques including tiered metadata...

View Article

Getting Started with the Alluxio-Presto Sandbox

July 11, 2019, 3:00 pm

The Alluxio-Presto sandbox is a docker application featuring installations of MySQL, Hadoop, Hive, Presto, and Alluxio. The sandbox lets you easily dive into an interactive environment where you can...

View Article

Scalable Filesystem Metadata Services with RocksDB

July 22, 2019, 3:39 pm

Alluxio maintainer and founding engineer Calvin Jia presents on Scalable Filesystem Metadata Services with RocksDB at the RocksDB meetup at Twitter. The post Scalable Filesystem Metadata Services with...

View Article

Alluxio New York Meetup: Accelerating Analytical Workloads for Public &...

July 22, 2019, 4:25 pm

Joint hosted Alluxio New York meetup with talks to include: Embracing hybrid cloud for data-intensive analytic workloads and Alluxio on AWS EMR (fast storage access and sharing for Spark). The post...

View Article

NetEase and Alluxio joint meetup

July 23, 2019, 12:50 pm

Joint meetup in Hangzhou discusses: An introduction to new features of big data storage system Alluxio and optimization of cache performance, Practice & exploration of Spark & Alluxio, and the...

View Article

Accelerating Write-intensive Data Workloads on AWS S3

August 7, 2019, 3:56 pm

Alluxio is an open-source data orchestration system widely used to speed up data-intensive workloads in the cloud. Alluxio v2.0 introduced Replicated Async Write to allow users to complete writes to...

View Article

Community Office Hour: Building a Cloud Native Stack with EMR Spark, Alluxio,...

August 27, 2019, 5:25 pm

Learn how to set up EMR Spark with Alluxio so Spark jobs can seamlessly read from and write to S3. See the performance comparison between Spark on S3 with Spark, and Alluxio on S3. The post Community...

View Article

Why Data Orchestration?

September 18, 2019, 11:13 am

Today’s current pace of innovation is hindered by the necessity of reinventing the wheel in order for applications to efficiently access data. When an engineer or scientist wants to write an...

View Article

Online Meetup: Powering Data Science and AI with Apache Spark, Alluxio, and IBM

October 29, 2019, 6:26 pm

Learn why leading companies are moving towards a decoupled compute and storage architecture, and the associated challenges and requirements. Hear about how Spark and Alluxio together can solve the...

View Article

Apache Iceberg – A Table Format for Huge Analytic Datasets

November 12, 2019, 6:04 pm

This talk includes why Netflix needed to build Iceberg, the project’s high-level design, and will highlight the details that unblock better query performance. The post Apache Iceberg – A Table Format...

View Article

How to Develop and Operate Cloud Native Data Platforms and Applications

November 12, 2019, 6:42 pm

In this talk, we share our lessons in building and rebuilding our monitoring systems and data platforms at Electronic Arts (EA). The post How to Develop and Operate Cloud Native Data Platforms and...

View Article

Enterprise Distributed Query Service Powered by Presto & Alluxio Across...

November 12, 2019, 6:49 pm

This session talks about challenges associated with querying diverse data sources at Walmart and how those are tackled using Presto & Alluxio. The post Enterprise Distributed Query Service Powered...

View Article

The Practice of Presto & Alluxio in E-Commerce Big Data Platform

November 15, 2019, 9:17 am

JD.com is China’s largest online retailer. It uses Alluxio to provide support for ad hoc and real-time stream computing, using Alluxio-compatible HDFS URLs and Alluxio as a pluggable optimization...

View Article

Integrating Google Cloud Dataproc with Alluxio for faster performance in the...

November 18, 2019, 1:35 pm

Learn how to set up Google Cloud Dataproc with Alluxio so jobs can seamlessly read from and write to Cloud Storage. See how to run Dataproc Spark against a remote HDFS cluster. The post Integrating...

View Article

Tech Talk: Integrating Google Cloud Dataproc with Alluxio for faster...

December 10, 2019, 2:17 pm

Chris Crosbie and Roderick Yao from the Google Dataproc team and Dipti Borkar of Alluxio demo how to set up Google Cloud Dataproc with Alluxio so jobs can seamlessly read from and write to Cloud...

View Article

NetEase and Alluxio joint meetup

July 23, 2019, 12:50 pm

Joint meetup in Hangzhou discusses: An introduction to new features of big data storage system Alluxio and optimization of cache performance, Practice & exploration of Spark & Alluxio, and the...

View Article

What’s new in Alluxio 2.2

March 11, 2020, 4:50 am

With this release comes the General Availability (GA) of Alluxio Structured Data Services (SDS), the subsystem of Alluxio responsible for managing and transforming structured data, such as databases,...

View Article

Optimizing Query Performance by Decoupling Presto and Hive Data Warehouse

March 24, 2020, 12:06 pm

Ideally, Presto would access data independently from how the data was originally stored or managed. Alluxio, as a data orchestration layer provides the physical data independence, for Presto to...

View Article

More Pages to Explore .....

Latest Images