Se ha denunciado esta presentación.
Utilizamos tu perfil de LinkedIn y tus datos de actividad para personalizar los anuncios y mostrarte publicidad más relevante. Puedes cambiar tus preferencias de publicidad en cualquier momento.

Best Practices • Again, there Getting Started on Hadoop

20.494 visualizaciones

Publicado el

Best Practices

• Again, there are much more efficient ways to handle Hadoop Streaming
and Text Analytics…
• Unit Tests, Continuous Integration, etc., – all great stuff, but “Big Data”
software engineering requires additional steps
• Sample data, measure data ratios and cluster behaviors, analyze in R,
visualize everything you can, calibrate any necessary “magic numbers”
• Develop and test code on a personal computer in IDE, cmd line, etc., using
a minimal data sets
• Deploy to staging cluster with larger data sets for integration tests and QA
• Run in production with A/B testing were feasible to evaluate changes
• Learn from others at meetups, unconfs, forums, etc.

Publicado en: Tecnología