sumeet singh yahoo hadoop hbase storm big data cloud mapreduce spark platform tco yarn hive security też deep learning omid hadoop summit hdfs caffeonspark apache hcatalog data discovery debunking myths metering data lifecycle management governance public vs private cloud architecture private cloud costing data apache software foundation data platform batch analytics machine learning warehousing storage real time compute gpu realtime processing ease of use sketches data sketches keynote metastore innovation open source opentsdb realtime systems kafka monitoring strata dataout capacity planning integration audits network hardware confuguration software stack bcp hardware configuration audit hadoop stack big data networks top 10 resource management preemption reservations sla management fair scheduler sla queues scheduling queue best practices speculative execution node labels capacity scheduler deadline constrained capacity management lz4 lzo deflate compression gzip zlib decompression pig snappy bzip2 sap internet namespace region groups multi-tenancy cost models public cloud tco models
Ver más