Se ha denunciado esta presentación.
Se está descargando tu SlideShare. ×

Introduction to data science club

Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Próximo SlideShare
Data science presentation
Data science presentation
Cargando en…3
×

Eche un vistazo a continuación

1 de 40 Anuncio

Introduction to data science club

Descargar para leer sin conexión

Being able to make data driven decisions is a crucial skill for any company. The requirements are growing tougher - the volume of collected data keeps increasing in orders of magnitude and the insights must be smarter and faster. Come learn more about why data science is important and what challenges the data teams need to face.

Being able to make data driven decisions is a crucial skill for any company. The requirements are growing tougher - the volume of collected data keeps increasing in orders of magnitude and the insights must be smarter and faster. Come learn more about why data science is important and what challenges the data teams need to face.

Anuncio
Anuncio

Más Contenido Relacionado

Presentaciones para usted (20)

Similares a Introduction to data science club (20)

Anuncio

Más reciente (20)

Anuncio

Introduction to data science club

  1. 1. INTRO INTO DATA SCIENCE CLUB 16th OCTOBER 2017
  2. 2. TABLE OF CONTENT INTRO WHY DATA SCIENCE CLUB AND WHAT ARE THE GOALS? INTRO INTO DATA SCIENCE
  3. 3. AI research isn't just in Silicon Valley - Alan Turing, born in London - DeepMind, founded in London - Amazon AI labs, Berlin - Yandex, Russia - Baido, China - ...
  4. 4. Why Data Science Club? - Network business and academia for mutual good - Business - Dedicated to adding a value - Needs an access to research, talent and innovation - Academia - Dedicated to research and teaching - Can't do all the research and teaching alone
  5. 5. Exponea is a company with ... - Team of Software Engineers that work on applied AI - 80% from FIIT and Matfyz - Data and interesting clients from all around the world - Lots of interesting problems to solve - Lots of relevant knowledge to share
  6. 6. STU FIIT is an education institution with ... - A study programme Intelligent Software Systems - Software Engineering - Artificial Intelligence - A relevant research and team of experienced researchers - Lots of graduates with careers related to data science - Great facilities where all of us can meet
  7. 7. What is Data Science Club? - Community of like minded people with interest in data, analytics, software engineering and AI - Regular meetups where we share knowledge and work on our challenges together
  8. 8. When? - 6 times per semester in both summer and winter - Monday starting at 16:00 with agenda for 3 hours: - Expert talk(s) - Practical workshop - Networking, consulting, discussions
  9. 9. Draft of Agenda Winter 1. Intro into Data Science 2. Data storage 3. SQL for data analytics 4. Map-Reduce 5. Stream data processing 6. Data visualisation Summer 1. Applications of machine learning 2. Process of machine learning 3. Classification 4. Recommendation engines 5. Bandit algorithms 6. Reinforcement learning Final agenda depends on your interest and availability of expert speakers
  10. 10. Goals - Build an active community - 3+ universities, 10+ companies, 100+ members - Create and share content - Reach 2000+ viewers - Contribute to research and applications - 5+ publications by B&A authors in 2017/18 - 5+ new features in software products
  11. 11. People - Jozo Kovac, Cofounder, CTO - Matus Cimerman, Head of AI - Peter Kovacs, AISW Engineer - Ondrej Brichta, AISW Engineer - Jakub Macina, AISW Engineer - Robert Lacok, AISW Engineer - Dalibor Meszaros, AISW Engineer - Lucia Siebestichova, Event Manager - Martina Kolibasova, QA Engineer
  12. 12. Contacts - FB Page: https://www.facebook.com/ExponeaSocietyBratislava/ - Slack: https://community.exponea.com/ - Wiki: https://github.com/exponea/data-science-club/wiki - Git: https://github.com/exponea/data-science-club - Email: matus.cimerman@exponea.com
  13. 13. MEET OUR TEAM WRITE HERE SOMETHING MY SEARCH FOR A FAMILY VACATION LETS START WITH A REAL WORLD PROBLEM
  14. 14. Challenges - We pay for traffic, how to increase customer value? - Can customer history improve the quality of search? - Search must be fast, render everything under 100ms
  15. 15. A quick analysis - A learn to rank problem
  16. 16. https://www.slideshare.net/MrChrisJohnson/interactive- recommender-systems-with-netflix-and-spotify
  17. 17. Designing Data-Intensive Applications
  18. 18. WORKSHOP 16th OCTOBER 2017
  19. 19. Challenges - step by step • Collect data • Preprocess • Train a model • Evaluate • Deploy • Improve the model Data Model Data* InsightModel
  20. 20. Collect data • If you have it, you’re done • If not, find a way to collect
  21. 21. Preprocess • What: • Select features • Normalize • Fill • Aggregate... • How: • Store • Scale out processing
  22. 22. Create a model • Learn about your domain • Go really simple first • Don’t reinvent the wheel
  23. 23. Evaluate • What metrics make sense? • Test sets vs live data
  24. 24. Deploy • Online API vs batch • Latency matters • Reliable • Monitor & evaluate • Retrain on new data
  25. 25. Improve the model • Read more papers and blogs. Is anyone doing it better? • Lead, innovate and be the best
  26. 26. Workshop 1. Set up your environment: dependencies, template & data 2. Preprocess: Clean, normalize, feature select (pandas/scikit-learn) 3. Train: Choose a model of your liking (scikit-learn) 4. Deploy: Create a REST API (flask)

×