SlideShare una empresa de Scribd logo
1 de 27
Descargar para leer sin conexión
Big and Open data.
Challenges for Smartcity
Victoria López
Grupo G-TeC
www.tecnologiaUCM.es
Universidad Complutense de Madrid
www.tecnologiaUCM.es http://grasia.fdi.ucm.es
ICIST 2014
Valencia
1
Index
• Introduction
• Fighting with Big Data: Genoma data
• What is Big Data?
• Technology transfer: Open Data opportunities
• Developing projects for Smartcity.
• Rmap, a real example in Madrid
• Conclusions
2
Introduction
– Mobile technologies
– Intelligent agents
– Optimization and forecasting
– Bioinformatics, Biostatistics
– …
– www.tecnologiaUCM.es
3
Fighting with the Big Data
• Every day we need to deal with more and more data.
• For many years, new computers with more memory and higher
speed seem to be the solution for data growing.
• Many researching areas which was fighting with the Big Data:
Bioinformatics, Genoma data, DNA, RNA, proteins and, in general all
biological data have been required by computing monitors and
storing in large data bases in several laboratories and researching
centers along the world.
The future of genomics rests on the foundation of the Human Genome Project4
Fighting with the Big Data
• Each time an organization or an individual is not
able to deal with data, a big data problem is
facing.
• Same philosophy than modern Big Data: large
data bases distributed along the world with
parallel processing when available and suitable
• (Sequence alignment and Dynamic Programming)
• The amount of biological data is a big data base.
5
Big Data
From Data Warehouse to Big Data
6
1970 relational model invented
RDBMS declared mainstream till 90s
One-size fits all, Elephant vendors- heavily
encoded even indexing by B-trees.
Alex ' Sandy' Pentland,
director of 'Media Lab' at
Massachusetts Institute of
Technology (MIT)
7
Nowadays bussiness needs a
high avalailability of data, then
new techniques must be
developed: Complex analytics,
Graph Databases
unstructured
data
8
¿Quién genera Big Data?
Progress and innovation are no longer hampered by the ability to collect data,
but the ability to manage, analyze, synthesize, visualize, and discover
knowledge from data collected in a timely manner and in a scalable way
Big Data
Big Data 3+1+1 V’s
9
Big Data
1. High Availability is now a requirement
2. Host and Cloudcomputing
3. Running in parallel
1. Data Aggregation process
2. Analytics on Data
3. GraphDBMSs similarities
4. Not only SQL: Cassandra* and MongoDB**
5. Moving toward ACID, people from Google admit ACID as a
good idea for working with dababases.
*The Apache Cassandra database is the right choice when you need
scalability and high availability without compromising performance.
**Document oriented storage
10
MONGO
11
• Main feature: scalability to many nodes
– Scan of 100 TB in 1 node @ 50 MB/sec = 23 days
– Scan in a cluster of 1000 nodes = 33 minutes
MapReduce
– Parallel programming model
– Simple concept, smart, suitable for multiple applications
– Big datasets  multi-node in multiprocessors
– Sets of nodes: Clusters or Grids (distributed programming)
• By Google (2004)
– Able to process 20 PB per day
– Based on Map & Reduce, classiclal methods in functional programming
related to the classic divide & conquer
– Come from numeric analysis (big matrix products).
Big Data: Map Reduce
MapReduce
• Friendly for non technical users
Map Reduce
12
Big Data: Map Reduce
– UsedbyYahoo!,Facebook,Twitter
Amazon,eBay…
– Canbeusedindifferentarchitectures:
bothclusters(in-house)andgrid
(Cloudcomputing)
http://hadoop.apache.org/
Hadoop
13
Big Data: Hadoop
Big Data: Datamining & Scalability
• Techniques of Datamining (Machine Learning, Data Clustering,
Predictive Models, etc.) are compatible with big data by complex
analytics
• Modeling prices in electricity Spanish markets under uncertainty
G. Miñana, H. Marrao, R. Caro, J. Gil, V. Lopez, B. González , F. Sun et al. (eds.), Knowledge Engineering
and Management, Advances in Intelligent Systems and Computing 214,DOI: 10.1007/978-3-642-37832-
4_46, Springer-Verlag Berlin Heidelberg 2014
• To get a scalable system
– Aggregation
– Generalization
– (Formal specification)
• Not only many cores, many nodes and out of memory data
- Host and Cloudcomputing
- Not all problems can be solve with the same techniques, Hadoop is
not enough
14
Technology transfer
• A great oportunity for researchers working to
transfer technology, who can increase their
efforts in developing new techniques for
– Monitoring data (Sensors, smartphones, …)
– Storing data (Cloudcomputing, Amazon S3, EC2,
Google BigQuery, Tableau …)
– Cleaning, Integrating & Processing data
– data (Data Curation at Scale: The Data Tamer System,
M. Stonebraker et al., CIDR 2013)
– Analysing data (R, SAS… but also Google, Amazon,
eBay..)
– Fully homomorphic encryption & searching on
encrypted data
15
Open Data
“Open data is data that can be freely used, reused and redistributed by anyone –
subject only, at most, to the requirement to attribute and sharealike.”
OpenDefinition.org -
“Open data is data that can be freely used,
reused and redistributed by anyone – subject
only, at most, to the requirement to attribute
and share alike.” OpenDefinition.org
Availability and Access: the data must be
available as a whole and at no more than a
reasonable reproduction cost, preferably by
downloading over the internet. The data
must also be available in a convenient and
modifiable form.
Reuse and Redistribution: the data must be
provided under terms that permit reuse and
redistribution including the intermixing with
other datasets. The data must be machine-
readable.
Universal Participation: everyone must be
able to use, reuse and redistribute – there
should be no discrimination against fields of
endeavour or against persons or groups. For
example, ‘non-commercial’ restrictions that
would prevent ‘commercial’ use, or
restrictions of use for certain purposes (e.g.
only in education), are not allowed.
16
Open Data
17
Why Open Data by Open Knowledge Foundation
18
Open Data for Smartcity
• What a citizen can expect when living in a
city?
• Internet of the things
– Libraries
– Public transportation, trafic monitoring
– Pets, devices, cars, even people
• Intelligent agents
– Interacting without our control
– Credit cards control (BBVA case of use)
19
Basic structure
Patrón Cliente/Servidor
PUBLIC
DATA
Web
Service
SERVER CLIENT
WEB
SERVER
20
NEW DATA IS
COLLECTED.
A SERVICE IS GIVEN
query
DATA TRANSFER
21
Recycla.me
22
Data Analytics
FROM (UNSTRUCTURED) DATA TO VALUE
23
Mariam Saucedo
Pilar Torralbo
Daniel Sanz
Recycla.me
Ana Alfaro
Sergio Ballesteros
Lidia Sesma
Héctor Martos
Álvaro Bustillo
Arturo Callejo
Belén Abellanas
Jaime Ramos
Ignacio P. de Ziriza
Victor Torres
Alberto Segovia
Miguel Bueno
Mar Octavio de
Toledo
Antonio Sanmartín
Carlos Fernández
MAPA DE RECURSOS
RECYCLA.TE
24
• Parks and gardens
• Parkings for
• Cars
• Motorbikes
• Bikes
• Recycing Points
• Fixed
• Mobile
• Cloths
• Stations
• Bioetanol
• Gas
• Oil
• Electric
• Routes for bikes
• Vías ciclistas
• Calles seguras
• Áreas de Prioridad Residencial
Madrid – Smart City
RMapRMap
25
26
Big and Open data.
Challenges for Smartcity
Victoria López
Grupo G-TeC
www.tecnologiaUCM.es
Universidad Complutense de Madrid
ICIST 2014
Valencia

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVA
BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVABDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVA
BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVA
 
Data Ownership & Trust in the IoT
Data Ownership & Trust in the IoTData Ownership & Trust in the IoT
Data Ownership & Trust in the IoT
 
NextGEOSS Cloud Computing needs managed by Terradue: key benefits of the new ...
NextGEOSS Cloud Computing needs managed by Terradue: key benefits of the new ...NextGEOSS Cloud Computing needs managed by Terradue: key benefits of the new ...
NextGEOSS Cloud Computing needs managed by Terradue: key benefits of the new ...
 
CINECA HPC Infrastructure
CINECA HPC InfrastructureCINECA HPC Infrastructure
CINECA HPC Infrastructure
 
Cloud computing nac
Cloud computing nacCloud computing nac
Cloud computing nac
 
FIWARE Global Summit - QuantumLeap: Time-series and Geographic Queries
FIWARE Global Summit - QuantumLeap: Time-series and Geographic QueriesFIWARE Global Summit - QuantumLeap: Time-series and Geographic Queries
FIWARE Global Summit - QuantumLeap: Time-series and Geographic Queries
 
Dockerized IoT Gateway Stack
Dockerized IoT Gateway StackDockerized IoT Gateway Stack
Dockerized IoT Gateway Stack
 
FIWARE Global Summit - What Comes Next?
FIWARE Global Summit - What Comes Next?FIWARE Global Summit - What Comes Next?
FIWARE Global Summit - What Comes Next?
 
Datenstrategie der Zukunft - Technologietrends, die Sie kennen müssen
Datenstrategie der Zukunft - Technologietrends, die Sie kennen müssenDatenstrategie der Zukunft - Technologietrends, die Sie kennen müssen
Datenstrategie der Zukunft - Technologietrends, die Sie kennen müssen
 
Session 1.1 linked data applied: a field report from the netherlands
Session 1.1   linked data applied: a field report from the netherlandsSession 1.1   linked data applied: a field report from the netherlands
Session 1.1 linked data applied: a field report from the netherlands
 
Helix Nebula Initiative
Helix Nebula InitiativeHelix Nebula Initiative
Helix Nebula Initiative
 
Geographical Open Data, Semantics and Smart Cities
Geographical Open Data, Semantics and Smart CitiesGeographical Open Data, Semantics and Smart Cities
Geographical Open Data, Semantics and Smart Cities
 
SnapLogic Live: AWS Integration
SnapLogic Live: AWS IntegrationSnapLogic Live: AWS Integration
SnapLogic Live: AWS Integration
 
Mundi Presentation - A Space of New Opportunities
Mundi Presentation - A Space of New OpportunitiesMundi Presentation - A Space of New Opportunities
Mundi Presentation - A Space of New Opportunities
 
Artik cloud deview 2016
Artik cloud   deview 2016Artik cloud   deview 2016
Artik cloud deview 2016
 
HNSciCloud Overview
HNSciCloud OverviewHNSciCloud Overview
HNSciCloud Overview
 
Helix Nebula Phase 1
Helix Nebula Phase 1Helix Nebula Phase 1
Helix Nebula Phase 1
 
BDE SC3.3 Workshop - Agenda
 BDE SC3.3 Workshop - Agenda BDE SC3.3 Workshop - Agenda
BDE SC3.3 Workshop - Agenda
 
FIWARE Global Summit - DRACO: Managing the Stream of Context Information Hist...
FIWARE Global Summit - DRACO: Managing the Stream of Context Information Hist...FIWARE Global Summit - DRACO: Managing the Stream of Context Information Hist...
FIWARE Global Summit - DRACO: Managing the Stream of Context Information Hist...
 
What can the cloud do for you?
What can the cloud do for you?What can the cloud do for you?
What can the cloud do for you?
 

Destacado

Open Goverment Data: What, why, how?
Open Goverment Data: What, why, how?Open Goverment Data: What, why, how?
Open Goverment Data: What, why, how?
Christian Villum
 
096 0461 psv7000-operator_manual
096 0461 psv7000-operator_manual096 0461 psv7000-operator_manual
096 0461 psv7000-operator_manual
Gebrielly
 
Ensayo final
Ensayo finalEnsayo final
Ensayo final
Ana León
 
Los lenguajes de programación son herramientas que nos permiten crear program...
Los lenguajes de programación son herramientas que nos permiten crear program...Los lenguajes de programación son herramientas que nos permiten crear program...
Los lenguajes de programación son herramientas que nos permiten crear program...
edwin6886
 

Destacado (20)

Cristal Digital Tuesdays - "Big Data Revolution" - Data and content, creating...
Cristal Digital Tuesdays - "Big Data Revolution" - Data and content, creating...Cristal Digital Tuesdays - "Big Data Revolution" - Data and content, creating...
Cristal Digital Tuesdays - "Big Data Revolution" - Data and content, creating...
 
Open Goverment Data: What, why, how?
Open Goverment Data: What, why, how?Open Goverment Data: What, why, how?
Open Goverment Data: What, why, how?
 
096 0461 psv7000-operator_manual
096 0461 psv7000-operator_manual096 0461 psv7000-operator_manual
096 0461 psv7000-operator_manual
 
Okuri Ventures
Okuri VenturesOkuri Ventures
Okuri Ventures
 
Ensayo final
Ensayo finalEnsayo final
Ensayo final
 
153453
153453153453
153453
 
Direccion y sus relacionesYELITZA MENDOZA
Direccion y sus relacionesYELITZA MENDOZADireccion y sus relacionesYELITZA MENDOZA
Direccion y sus relacionesYELITZA MENDOZA
 
How To Extract & Apply Social Intelligence from Twitter & Instagram
How To Extract & Apply Social Intelligence from Twitter & InstagramHow To Extract & Apply Social Intelligence from Twitter & Instagram
How To Extract & Apply Social Intelligence from Twitter & Instagram
 
Carta de diciembre de Carmignac
Carta de diciembre de CarmignacCarta de diciembre de Carmignac
Carta de diciembre de Carmignac
 
Web 2.0, Competencias 2.0 y Redes Sociales
Web 2.0, Competencias 2.0 y Redes SocialesWeb 2.0, Competencias 2.0 y Redes Sociales
Web 2.0, Competencias 2.0 y Redes Sociales
 
Babuder borno chena
Babuder borno chenaBabuder borno chena
Babuder borno chena
 
Swap guide
Swap guideSwap guide
Swap guide
 
Redes Sociales y turismo
Redes Sociales y turismo Redes Sociales y turismo
Redes Sociales y turismo
 
Vues du Zinc n° 44 – juin 2011
Vues du Zinc n° 44 – juin 2011Vues du Zinc n° 44 – juin 2011
Vues du Zinc n° 44 – juin 2011
 
Cuentas Nacionales - Regionales Antofagasta
Cuentas Nacionales - Regionales AntofagastaCuentas Nacionales - Regionales Antofagasta
Cuentas Nacionales - Regionales Antofagasta
 
Gold 2013 Sydney - Chesser Resources ASX:CHZ
Gold 2013 Sydney - Chesser Resources ASX:CHZGold 2013 Sydney - Chesser Resources ASX:CHZ
Gold 2013 Sydney - Chesser Resources ASX:CHZ
 
Influenza proms
Influenza promsInfluenza proms
Influenza proms
 
Los lenguajes de programación son herramientas que nos permiten crear program...
Los lenguajes de programación son herramientas que nos permiten crear program...Los lenguajes de programación son herramientas que nos permiten crear program...
Los lenguajes de programación son herramientas que nos permiten crear program...
 
Using Buy A Feature Online
Using Buy A Feature OnlineUsing Buy A Feature Online
Using Buy A Feature Online
 
Estudio efectos del electrosmog en área 22@ de BCN
Estudio efectos del electrosmog en área 22@ de BCNEstudio efectos del electrosmog en área 22@ de BCN
Estudio efectos del electrosmog en área 22@ de BCN
 

Similar a Big & Open Data: Challenges for Smartcity

BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docxBIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
tangyechloe
 
BIMCV: The Perfect "Big Data" Storm.
BIMCV: The Perfect "Big Data" Storm. BIMCV: The Perfect "Big Data" Storm.
BIMCV: The Perfect "Big Data" Storm.
maigva
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docx
dickonsondorris
 

Similar a Big & Open Data: Challenges for Smartcity (20)

Fortune Time Institute: Big Data - Challenges for Smartcity
Fortune Time Institute: Big Data - Challenges for SmartcityFortune Time Institute: Big Data - Challenges for Smartcity
Fortune Time Institute: Big Data - Challenges for Smartcity
 
Big Data et eGovernment
Big Data et eGovernmentBig Data et eGovernment
Big Data et eGovernment
 
Enabling the physical world to the Internet and potential benefits for agricu...
Enabling the physical world to the Internet and potential benefits for agricu...Enabling the physical world to the Internet and potential benefits for agricu...
Enabling the physical world to the Internet and potential benefits for agricu...
 
Big data Mining Using Very-Large-Scale Data Processing Platforms
Big data Mining Using Very-Large-Scale Data Processing PlatformsBig data Mining Using Very-Large-Scale Data Processing Platforms
Big data Mining Using Very-Large-Scale Data Processing Platforms
 
BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docxBIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
 
BIMCV, Banco de Imagen Medica de la Comunidad Valenciana. María de la Iglesia
BIMCV, Banco de Imagen Medica de la Comunidad Valenciana. María de la IglesiaBIMCV, Banco de Imagen Medica de la Comunidad Valenciana. María de la Iglesia
BIMCV, Banco de Imagen Medica de la Comunidad Valenciana. María de la Iglesia
 
Big data
Big dataBig data
Big data
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
BIMCV: The Perfect "Big Data" Storm.
BIMCV: The Perfect "Big Data" Storm. BIMCV: The Perfect "Big Data" Storm.
BIMCV: The Perfect "Big Data" Storm.
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Lecture 1-big data engineering (Introduction).pdf
Lecture 1-big data engineering (Introduction).pdfLecture 1-big data engineering (Introduction).pdf
Lecture 1-big data engineering (Introduction).pdf
 
Big data and Internet
Big data and InternetBig data and Internet
Big data and Internet
 
Big data with hadoop
Big data with hadoopBig data with hadoop
Big data with hadoop
 
bigdataintro.pptx
bigdataintro.pptxbigdataintro.pptx
bigdataintro.pptx
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-Hadoop
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigData
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docx
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its Challenges
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big data
 

Más de Victoria López

Más de Victoria López (20)

Alan turing uva-presentationdec-2019
Alan turing uva-presentationdec-2019Alan turing uva-presentationdec-2019
Alan turing uva-presentationdec-2019
 
Seminar UvA 2018- socialbigdata
Seminar UvA  2018- socialbigdataSeminar UvA  2018- socialbigdata
Seminar UvA 2018- socialbigdata
 
Jornada leiden short
Jornada leiden shortJornada leiden short
Jornada leiden short
 
BIG DATA EN CIENCIAS DE LA SALUD Y CIENCIAS SOCIALES
BIG DATA EN CIENCIAS DE LA SALUD Y CIENCIAS SOCIALESBIG DATA EN CIENCIAS DE LA SALUD Y CIENCIAS SOCIALES
BIG DATA EN CIENCIAS DE LA SALUD Y CIENCIAS SOCIALES
 
ICCES'2016 BIG DATA IN HEALTHCARE AND SOCIAL SCIENCES
ICCES'2016  BIG DATA IN HEALTHCARE AND SOCIAL SCIENCESICCES'2016  BIG DATA IN HEALTHCARE AND SOCIAL SCIENCES
ICCES'2016 BIG DATA IN HEALTHCARE AND SOCIAL SCIENCES
 
Presentación Gupo G-TeC en Social Big Data
Presentación Gupo G-TeC en Social Big DataPresentación Gupo G-TeC en Social Big Data
Presentación Gupo G-TeC en Social Big Data
 
Big data systems and analytics
Big data systems and analyticsBig data systems and analytics
Big data systems and analytics
 
Big Data. Complejidad,algoritmos y su procesamiento
Big Data. Complejidad,algoritmos y su procesamientoBig Data. Complejidad,algoritmos y su procesamiento
Big Data. Complejidad,algoritmos y su procesamiento
 
APLICACIÓN DE TÉCNICAS DE OPTIMIZACIÓN Y BIG DATA AL PROBLEMA DE BÚSQUEDA...
APLICACIÓN DE TÉCNICAS DE OPTIMIZACIÓN Y BIG DATA AL PROBLEMA DE BÚSQUEDA...APLICACIÓN DE TÉCNICAS DE OPTIMIZACIÓN Y BIG DATA AL PROBLEMA DE BÚSQUEDA...
APLICACIÓN DE TÉCNICAS DE OPTIMIZACIÓN Y BIG DATA AL PROBLEMA DE BÚSQUEDA...
 
G te c sesion1a-bioinformatica y big data
G te c sesion1a-bioinformatica y big dataG te c sesion1a-bioinformatica y big data
G te c sesion1a-bioinformatica y big data
 
G te c sesion1b-casos de uso
G te c sesion1b-casos de usoG te c sesion1b-casos de uso
G te c sesion1b-casos de uso
 
G te c sesion2a-data collection
G te c sesion2a-data collectionG te c sesion2a-data collection
G te c sesion2a-data collection
 
G tec sesion2b-host-cloud y cloudcomputing
G tec sesion2b-host-cloud y cloudcomputingG tec sesion2b-host-cloud y cloudcomputing
G tec sesion2b-host-cloud y cloudcomputing
 
G te c sesion3a-bases de datos modernas
G te c sesion3a-bases de datos modernasG te c sesion3a-bases de datos modernas
G te c sesion3a-bases de datos modernas
 
G te c sesion3b- mapreduce
G te c sesion3b- mapreduceG te c sesion3b- mapreduce
G te c sesion3b- mapreduce
 
G te c sesion4a-bigdatasystemsanalytics
G te c sesion4a-bigdatasystemsanalyticsG te c sesion4a-bigdatasystemsanalytics
G te c sesion4a-bigdatasystemsanalytics
 
G te c sesion4b-complejidad y tpa
G te c sesion4b-complejidad y tpaG te c sesion4b-complejidad y tpa
G te c sesion4b-complejidad y tpa
 
Open Data para Smartcity-Facultad de Estudios Estadísticos
Open Data para Smartcity-Facultad de Estudios EstadísticosOpen Data para Smartcity-Facultad de Estudios Estadísticos
Open Data para Smartcity-Facultad de Estudios Estadísticos
 
Deep Learning + R by Gabriel Valverde
Deep Learning + R by Gabriel ValverdeDeep Learning + R by Gabriel Valverde
Deep Learning + R by Gabriel Valverde
 
Curso Big Data. Introducción a Deep Learning by Gabriel Valverde Castilla
Curso Big Data. Introducción a  Deep Learning by Gabriel Valverde CastillaCurso Big Data. Introducción a  Deep Learning by Gabriel Valverde Castilla
Curso Big Data. Introducción a Deep Learning by Gabriel Valverde Castilla
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

Big & Open Data: Challenges for Smartcity

  • 1. Big and Open data. Challenges for Smartcity Victoria López Grupo G-TeC www.tecnologiaUCM.es Universidad Complutense de Madrid www.tecnologiaUCM.es http://grasia.fdi.ucm.es ICIST 2014 Valencia 1
  • 2. Index • Introduction • Fighting with Big Data: Genoma data • What is Big Data? • Technology transfer: Open Data opportunities • Developing projects for Smartcity. • Rmap, a real example in Madrid • Conclusions 2
  • 3. Introduction – Mobile technologies – Intelligent agents – Optimization and forecasting – Bioinformatics, Biostatistics – … – www.tecnologiaUCM.es 3
  • 4. Fighting with the Big Data • Every day we need to deal with more and more data. • For many years, new computers with more memory and higher speed seem to be the solution for data growing. • Many researching areas which was fighting with the Big Data: Bioinformatics, Genoma data, DNA, RNA, proteins and, in general all biological data have been required by computing monitors and storing in large data bases in several laboratories and researching centers along the world. The future of genomics rests on the foundation of the Human Genome Project4
  • 5. Fighting with the Big Data • Each time an organization or an individual is not able to deal with data, a big data problem is facing. • Same philosophy than modern Big Data: large data bases distributed along the world with parallel processing when available and suitable • (Sequence alignment and Dynamic Programming) • The amount of biological data is a big data base. 5
  • 6. Big Data From Data Warehouse to Big Data 6 1970 relational model invented RDBMS declared mainstream till 90s One-size fits all, Elephant vendors- heavily encoded even indexing by B-trees.
  • 7. Alex ' Sandy' Pentland, director of 'Media Lab' at Massachusetts Institute of Technology (MIT) 7 Nowadays bussiness needs a high avalailability of data, then new techniques must be developed: Complex analytics, Graph Databases
  • 8. unstructured data 8 ¿Quién genera Big Data? Progress and innovation are no longer hampered by the ability to collect data, but the ability to manage, analyze, synthesize, visualize, and discover knowledge from data collected in a timely manner and in a scalable way
  • 9. Big Data Big Data 3+1+1 V’s 9
  • 10. Big Data 1. High Availability is now a requirement 2. Host and Cloudcomputing 3. Running in parallel 1. Data Aggregation process 2. Analytics on Data 3. GraphDBMSs similarities 4. Not only SQL: Cassandra* and MongoDB** 5. Moving toward ACID, people from Google admit ACID as a good idea for working with dababases. *The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. **Document oriented storage 10 MONGO
  • 11. 11 • Main feature: scalability to many nodes – Scan of 100 TB in 1 node @ 50 MB/sec = 23 days – Scan in a cluster of 1000 nodes = 33 minutes MapReduce – Parallel programming model – Simple concept, smart, suitable for multiple applications – Big datasets  multi-node in multiprocessors – Sets of nodes: Clusters or Grids (distributed programming) • By Google (2004) – Able to process 20 PB per day – Based on Map & Reduce, classiclal methods in functional programming related to the classic divide & conquer – Come from numeric analysis (big matrix products). Big Data: Map Reduce MapReduce
  • 12. • Friendly for non technical users Map Reduce 12 Big Data: Map Reduce
  • 14. Big Data: Datamining & Scalability • Techniques of Datamining (Machine Learning, Data Clustering, Predictive Models, etc.) are compatible with big data by complex analytics • Modeling prices in electricity Spanish markets under uncertainty G. Miñana, H. Marrao, R. Caro, J. Gil, V. Lopez, B. González , F. Sun et al. (eds.), Knowledge Engineering and Management, Advances in Intelligent Systems and Computing 214,DOI: 10.1007/978-3-642-37832- 4_46, Springer-Verlag Berlin Heidelberg 2014 • To get a scalable system – Aggregation – Generalization – (Formal specification) • Not only many cores, many nodes and out of memory data - Host and Cloudcomputing - Not all problems can be solve with the same techniques, Hadoop is not enough 14
  • 15. Technology transfer • A great oportunity for researchers working to transfer technology, who can increase their efforts in developing new techniques for – Monitoring data (Sensors, smartphones, …) – Storing data (Cloudcomputing, Amazon S3, EC2, Google BigQuery, Tableau …) – Cleaning, Integrating & Processing data – data (Data Curation at Scale: The Data Tamer System, M. Stonebraker et al., CIDR 2013) – Analysing data (R, SAS… but also Google, Amazon, eBay..) – Fully homomorphic encryption & searching on encrypted data 15
  • 16. Open Data “Open data is data that can be freely used, reused and redistributed by anyone – subject only, at most, to the requirement to attribute and sharealike.” OpenDefinition.org - “Open data is data that can be freely used, reused and redistributed by anyone – subject only, at most, to the requirement to attribute and share alike.” OpenDefinition.org Availability and Access: the data must be available as a whole and at no more than a reasonable reproduction cost, preferably by downloading over the internet. The data must also be available in a convenient and modifiable form. Reuse and Redistribution: the data must be provided under terms that permit reuse and redistribution including the intermixing with other datasets. The data must be machine- readable. Universal Participation: everyone must be able to use, reuse and redistribute – there should be no discrimination against fields of endeavour or against persons or groups. For example, ‘non-commercial’ restrictions that would prevent ‘commercial’ use, or restrictions of use for certain purposes (e.g. only in education), are not allowed. 16
  • 18. Why Open Data by Open Knowledge Foundation 18
  • 19. Open Data for Smartcity • What a citizen can expect when living in a city? • Internet of the things – Libraries – Public transportation, trafic monitoring – Pets, devices, cars, even people • Intelligent agents – Interacting without our control – Credit cards control (BBVA case of use) 19
  • 21. NEW DATA IS COLLECTED. A SERVICE IS GIVEN query DATA TRANSFER 21
  • 24. Mariam Saucedo Pilar Torralbo Daniel Sanz Recycla.me Ana Alfaro Sergio Ballesteros Lidia Sesma Héctor Martos Álvaro Bustillo Arturo Callejo Belén Abellanas Jaime Ramos Ignacio P. de Ziriza Victor Torres Alberto Segovia Miguel Bueno Mar Octavio de Toledo Antonio Sanmartín Carlos Fernández MAPA DE RECURSOS RECYCLA.TE 24
  • 25. • Parks and gardens • Parkings for • Cars • Motorbikes • Bikes • Recycing Points • Fixed • Mobile • Cloths • Stations • Bioetanol • Gas • Oil • Electric • Routes for bikes • Vías ciclistas • Calles seguras • Áreas de Prioridad Residencial Madrid – Smart City RMapRMap 25
  • 26. 26
  • 27. Big and Open data. Challenges for Smartcity Victoria López Grupo G-TeC www.tecnologiaUCM.es Universidad Complutense de Madrid ICIST 2014 Valencia