alluxio data orchestration storage open source big data presto cloud spark distributed computing hybrid cloud machine learning file system data management summit memory tachyon project analytics aws cloud computing alluxio day performance hadoop hdfs hive separation of compute and storage data architecture caching data platform cloud storage s3 distributed systems apache spark ai multi cloud model training aws s3 data analytics data engineering kubernetes sql distributed storage compute alluxio engineering infrastructure emr data deep learning data loading data lake object store meetup data locality tachyon artificial intelligence gpu object stores tech talk release gpu utilization llm intel fuse rocksdb cache architecture data lakes facebook local cache cloud bursting google dataproc unified namespace uber posix orchestration tensorflow pytorch trino cloud migration database ml gpu analytics use case apache hudi apache ozone raft office hour object storage scale distributed data caching query engine hybrid cloud bursting overview compute storage separation computer metadata community tencent ceph memory centric ray software engineering software development product school zookeeper apache iceberg presto caching microsoft fluid alibaba datasapiens under file system zero copy bursting on-prem analytics zoo amazon emr nfs structured data management rakuten data stores conference baidu data warehouse grpc data stack demo amazon web services data ecosystem jd kyligence olap memory-centric python computing generative ai cv api model traiing devops transparent uri product release analytics and ai cloud architecture twitter virtual file system apache ranger hybrid big data netapp bilibili data tagging open data platform metadata management shadow cache tiktok cache layer prometheus metrics grafana optane persistent memory raptorx disaggregated storage rapids accelerator data lake analytics dask aspect analytics webinar terraform eks t3go walkme unisound atlas starburst robinhood data catalog paypal gimel sql workloads jd.com distributed applications ing tech dataproc google cloud hybrid data lake helixa comcast china unicom aunalytics hub hybrid shannondb storagequery s3 api analytic workloads public cloud deep learning applications high performance high-performance scalable metadata services structured data services catalog service spark workloads remote data software testing unified data zero copy hybrid bursting mapr cloud workloads dc/os object store analytics on-premise compute e-commerce datasets pipeline api usability concurrency iceberg netflix alibaba cloud gene computing structured data search queries ryte zero-copy burst distributed query walmartlabs global namespace multi-tiering 2.0 preview unified bigdata tutorial storage system security parquet amazon amplab pingo tachyon nexus elastic mapreduce developers developer datawarehouse etl financial services decoupling compute and storage data unification virtualization distributed system in-memory storage qiniu sogou business intelligence ctrip momo talking data nvidia mesosphere qunar strata
Ver más