This presentation is a supplementary material for the following article -> Nikiforova, A. (2020). Assessment of the usability of Latvia’s open data portal or how close are we to gaining benefits from open data. In In IADIS 14th International Conference on Interfaces and Human Computer Interaction (pp. 51-28).
Nowadays, more and more countries are launching their own open data portals, seeking to provide their citizens with open data in a form that is useful and suitable for the original purpose of the open data, and Latvia is not an exception. Despite the fact that the Latvian open data portal was launched only in 2017, it is considered to be a fast-tracker. However, despite the overall high evaluations, critical voices,and many discussions about whether the Latvia’s open data portal is of sufficient quality to be appeared. Therefore, while previous studies deal with quality of open data, this study focuses on the analysis of the Latvian open data portal and aims to find the key challenges that may have a negative impact on user experience.The paper assesses the current situation and recommends corrective actions,highlighting the aspects to be considered when developing and improving open data portals.
Comparative analysis of national open data portals or whether your portal is ...Anastasija Nikiforova
This file is a supplementary material for the following article -> Nikiforova, A. (2020). Comparative analysis of national open data portals or whether your portal is ready to bring benefits from open data. In IADIS International Conference on ICT, Society and Human Beings (pp. 21-23).
This paper focuses on the analysis of usability of the national open data portals. Open [government] data are considered as one of the most influenceable tool for preventing and reducing corruption and reaching innovative solutions that create added value for society.Thus, it is important to ensure that they are provided in a form that are useful and suitable for the original purpose of the open data. Critical voices and many discussions on whether open government data and national open data portals are of sufficient quality appear more frequently. Therefore, this study deals with this topic and aims to find the main challenges that can negatively impact users’ experience through an analysis of usability of 42 open data portals by applying a unified methodology on them allowing their comparative analysis to be carried out.This study highlights the weakest aspects for 42 national open data portals, pointing on both, the most common weakest points, and individual. The analysis carried out also identifies portals that can be considered as leaders and as an example for the less successful open data portals.
TIMELINESS OF OPEN DATA IN OPEN GOVERNMENT DATA PORTALS THROUGH PANDEMIC-RELA...Anastasija Nikiforova
This presentation is a supplementary material for the following article -> Nikiforova, A. (2020, October). Timeliness of open data in open government data portals through pandemic-related data: a long data way from the publisher to the user. In 2020 Fourth International Conference on Multimedia Computing, Networking and Applications (MCNA) (pp. 131-138). IEEE.
The paper addresses the “timeliness” of data in open government data (OGD) portals. It is one of the primary principles of open data, which is considered to be a success factor, while at the same time it is one of the biggest barriers that can disrupt users trust in data and even the desire to use the entire open data portal. However, assessing this aspect is a very difficult task that, in most cases, becomes an impossible for open data users. There is therefore a lack of comparative studies on the timeliness of data of different national open data portals. Unfortunately, 2020 gave the opportunity to find out this. It became easy enough to compare how long is the data path from the data holder to the OGD portal by analysing the timeliness of Covid-19-related data sets in relation to the first case observed in a country. The study thus fills the gap of comparative studies by addressing 60 countries and their OGD portals concerning the timeliness of the data, providing a report on how much and what countries provide the open data as quickly as possible. It makes it possible to understand how quickly OGD portals react to emergencies by opening and updating data for their further potential reuse, which is essential in the digital data-driven world.
Read paper here -> Nikiforova, A. (2020, October). Timeliness of open data in open government data portals through pandemic-related data: a long data way from the publisher to the user. In 2020 Fourth International Conference on Multimedia Computing, Networking and Applications (MCNA) (pp. 131-138). IEEE.https://ieeexplore.ieee.org/abstract/document/9264298?casa_token=FtfC_6bqZnsAAAAA:TaSnKrE7ZCxLyq5hvxX-X8O2sK_vZYcodTBtxoWOvaOAIFmMmy65f5dIK-kKYxFAMiC5jyl7Eeg
OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERSAnastasija Nikiforova
"OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERS" set of slides was prepared for the Guest Lecture, which I has delivered to the students of the University of South-Eastern Norway (USN), October 2021
Towards enrichment of the open government data: a stakeholder-centered determ...Anastasija Nikiforova
This set of slides is a part of the presentation prepared and delivered in the scope of the 14th International Conference on Theory and Practice of Electronic Governance (ICEGOV 2021), 6-8 October, 2021, Smart Digital Governance for Global Sustainability
It is based on the paper -> Nikiforova, A. (2021, October). Towards enrichment of the open government data: a stakeholder-centered determination of High-Value Data sets for Latvia. In 14th International Conference on Theory and Practice of Electronic Governance (pp. 367-372) -> https://dl.acm.org/doi/abs/10.1145/3494193.3494243?casa_token=bPeuwmFWwQwAAAAA:ls-xXIPK5uXDHyxtBxqsMJOCuV6ud_ip59BX8n78uJnqvql6e8H9urlDG9zzeNklRmGFwI4sCXU06w
Analysis of open health data quality using data object-driven approach to dat...Anastasija Nikiforova
This presentation is a supplementary material for the following article -> Nikiforova, A. (2019). Analysis of open health data quality using data object-driven approach to data quality evaluation: insights from a Latvian context. In IADIS International Conference e-Health (pp. 119-126).
This research focuses on the analysis of the quality of open health data that are freely available and can be used by everyone for their own purposes. The quality of open data is crucial as it can lead to unreliable decision-making and financial losses, however, the quality of open health data has even more critical role.Despite its importance, this topic is rarely discussed.Therefore, the previously proposed data object-driven approach to data quality evaluation is applied to open health data in Latvia in order to (a) evaluate their quality, highlighting common quality issues that should be considered by both, users and data publishers, (b) demonstrate that the used approach is suitable for given purpose as it is simple enough,and ensures the involvement of users even without IT and data quality knowledge (domain experts) in the data quality analysis examining data for their own purposes. The proposed solution seems to be useful in establishing communication between data users and publishers,improving the overall quality of data.
IoTSE-based Open Database Vulnerability inspection in three Baltic Countries:...Anastasija Nikiforova
This presentation is devoted to the "IoTSE-based Open Database Vulnerability inspection in three Baltic Countries: ShoBEVODSDT sees you" research paper developed by Artjoms Daskevics and Anastasija Nikiforova and presented during the The International conference on Internet of Things, Systems, Management and Security (IOTSMS2021) co-located with The 8th International Conference on Social Networks Analysis, Management and Security (SNAMS2021), December 6-9, 2021, Valencia, Spain (online)
Read paper here -> Daskevics, A., & Nikiforova, A. (2021, December). IoTSE-based open database vulnerability inspection in three Baltic countries: ShoBEVODSDT sees you. In 2021 8th International Conference on Internet of Things: Systems, Management and Security (IOTSMS) (pp. 1-8). IEEE -> https://ieeexplore.ieee.org/abstract/document/9704952?casa_token=NfEjYuud0wEAAAAA:6QxucVPuY762I3qzD6D_oWqa0B9eMUFRNMG-E7dyHKohSYIzI0bH1V9bLaAcly_Lp-Ll52ghO5Y
Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can...Anastasija Nikiforova
This presentations is a supplementary material for presenting the "Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can Save Your Business" (authored by Anastasija Nikiforova and Natalija Kozmina) research paper during the The International Conference on Intelligent Data Science Technologies and Applications (IDSTA2021), November 15-16, 2021. Tartu, Estonia (web-based)
Read paper here -> Nikiforova, A., & Kozmina, N. (2021, November). Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can Save Your Business. In 2021 Second International Conference on Intelligent Data Science Technologies and Applications (IDSTA) (pp. 66-73). IEEE -> https://ieeexplore.ieee.org/abstract/document/9660802?casa_token=LFJa20LrXAwAAAAA:wVwhTcCPWqxdloAvDQ3-l98KkkLx70xzG3zNvIIkJbC6wvJ4VxwX_VGc3mmW_7c1T-QJlOtTiao
Invited talk "Open Data as a driver of Society 5.0: how you and your scientif...Anastasija Nikiforova
This presentation is prepared as a part of my talk on the openness (open data and open science) in the context of Society 5.0 during the International Conference and Expo on Nanotechnology and Nanomaterials. It was very pleasant to receive an invitation to deliver the talk on my recently published article Smarter Open Government Data for Society 5.0: Are Your Open Data Smart Enough? (Sensors 2021, 21(15), 5204), which I have entitled as “Open Data as a driver of Society 5.0: how you and your scientific outputs can contribute to the development of the Super Smart Society and transformation into Smart Living?“. The paper has been briefly discussed in my previous post, thus, just a few words on this talk and overall experience.
Comparative analysis of national open data portals or whether your portal is ...Anastasija Nikiforova
This file is a supplementary material for the following article -> Nikiforova, A. (2020). Comparative analysis of national open data portals or whether your portal is ready to bring benefits from open data. In IADIS International Conference on ICT, Society and Human Beings (pp. 21-23).
This paper focuses on the analysis of usability of the national open data portals. Open [government] data are considered as one of the most influenceable tool for preventing and reducing corruption and reaching innovative solutions that create added value for society.Thus, it is important to ensure that they are provided in a form that are useful and suitable for the original purpose of the open data. Critical voices and many discussions on whether open government data and national open data portals are of sufficient quality appear more frequently. Therefore, this study deals with this topic and aims to find the main challenges that can negatively impact users’ experience through an analysis of usability of 42 open data portals by applying a unified methodology on them allowing their comparative analysis to be carried out.This study highlights the weakest aspects for 42 national open data portals, pointing on both, the most common weakest points, and individual. The analysis carried out also identifies portals that can be considered as leaders and as an example for the less successful open data portals.
TIMELINESS OF OPEN DATA IN OPEN GOVERNMENT DATA PORTALS THROUGH PANDEMIC-RELA...Anastasija Nikiforova
This presentation is a supplementary material for the following article -> Nikiforova, A. (2020, October). Timeliness of open data in open government data portals through pandemic-related data: a long data way from the publisher to the user. In 2020 Fourth International Conference on Multimedia Computing, Networking and Applications (MCNA) (pp. 131-138). IEEE.
The paper addresses the “timeliness” of data in open government data (OGD) portals. It is one of the primary principles of open data, which is considered to be a success factor, while at the same time it is one of the biggest barriers that can disrupt users trust in data and even the desire to use the entire open data portal. However, assessing this aspect is a very difficult task that, in most cases, becomes an impossible for open data users. There is therefore a lack of comparative studies on the timeliness of data of different national open data portals. Unfortunately, 2020 gave the opportunity to find out this. It became easy enough to compare how long is the data path from the data holder to the OGD portal by analysing the timeliness of Covid-19-related data sets in relation to the first case observed in a country. The study thus fills the gap of comparative studies by addressing 60 countries and their OGD portals concerning the timeliness of the data, providing a report on how much and what countries provide the open data as quickly as possible. It makes it possible to understand how quickly OGD portals react to emergencies by opening and updating data for their further potential reuse, which is essential in the digital data-driven world.
Read paper here -> Nikiforova, A. (2020, October). Timeliness of open data in open government data portals through pandemic-related data: a long data way from the publisher to the user. In 2020 Fourth International Conference on Multimedia Computing, Networking and Applications (MCNA) (pp. 131-138). IEEE.https://ieeexplore.ieee.org/abstract/document/9264298?casa_token=FtfC_6bqZnsAAAAA:TaSnKrE7ZCxLyq5hvxX-X8O2sK_vZYcodTBtxoWOvaOAIFmMmy65f5dIK-kKYxFAMiC5jyl7Eeg
OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERSAnastasija Nikiforova
"OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERS" set of slides was prepared for the Guest Lecture, which I has delivered to the students of the University of South-Eastern Norway (USN), October 2021
Towards enrichment of the open government data: a stakeholder-centered determ...Anastasija Nikiforova
This set of slides is a part of the presentation prepared and delivered in the scope of the 14th International Conference on Theory and Practice of Electronic Governance (ICEGOV 2021), 6-8 October, 2021, Smart Digital Governance for Global Sustainability
It is based on the paper -> Nikiforova, A. (2021, October). Towards enrichment of the open government data: a stakeholder-centered determination of High-Value Data sets for Latvia. In 14th International Conference on Theory and Practice of Electronic Governance (pp. 367-372) -> https://dl.acm.org/doi/abs/10.1145/3494193.3494243?casa_token=bPeuwmFWwQwAAAAA:ls-xXIPK5uXDHyxtBxqsMJOCuV6ud_ip59BX8n78uJnqvql6e8H9urlDG9zzeNklRmGFwI4sCXU06w
Analysis of open health data quality using data object-driven approach to dat...Anastasija Nikiforova
This presentation is a supplementary material for the following article -> Nikiforova, A. (2019). Analysis of open health data quality using data object-driven approach to data quality evaluation: insights from a Latvian context. In IADIS International Conference e-Health (pp. 119-126).
This research focuses on the analysis of the quality of open health data that are freely available and can be used by everyone for their own purposes. The quality of open data is crucial as it can lead to unreliable decision-making and financial losses, however, the quality of open health data has even more critical role.Despite its importance, this topic is rarely discussed.Therefore, the previously proposed data object-driven approach to data quality evaluation is applied to open health data in Latvia in order to (a) evaluate their quality, highlighting common quality issues that should be considered by both, users and data publishers, (b) demonstrate that the used approach is suitable for given purpose as it is simple enough,and ensures the involvement of users even without IT and data quality knowledge (domain experts) in the data quality analysis examining data for their own purposes. The proposed solution seems to be useful in establishing communication between data users and publishers,improving the overall quality of data.
IoTSE-based Open Database Vulnerability inspection in three Baltic Countries:...Anastasija Nikiforova
This presentation is devoted to the "IoTSE-based Open Database Vulnerability inspection in three Baltic Countries: ShoBEVODSDT sees you" research paper developed by Artjoms Daskevics and Anastasija Nikiforova and presented during the The International conference on Internet of Things, Systems, Management and Security (IOTSMS2021) co-located with The 8th International Conference on Social Networks Analysis, Management and Security (SNAMS2021), December 6-9, 2021, Valencia, Spain (online)
Read paper here -> Daskevics, A., & Nikiforova, A. (2021, December). IoTSE-based open database vulnerability inspection in three Baltic countries: ShoBEVODSDT sees you. In 2021 8th International Conference on Internet of Things: Systems, Management and Security (IOTSMS) (pp. 1-8). IEEE -> https://ieeexplore.ieee.org/abstract/document/9704952?casa_token=NfEjYuud0wEAAAAA:6QxucVPuY762I3qzD6D_oWqa0B9eMUFRNMG-E7dyHKohSYIzI0bH1V9bLaAcly_Lp-Ll52ghO5Y
Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can...Anastasija Nikiforova
This presentations is a supplementary material for presenting the "Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can Save Your Business" (authored by Anastasija Nikiforova and Natalija Kozmina) research paper during the The International Conference on Intelligent Data Science Technologies and Applications (IDSTA2021), November 15-16, 2021. Tartu, Estonia (web-based)
Read paper here -> Nikiforova, A., & Kozmina, N. (2021, November). Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can Save Your Business. In 2021 Second International Conference on Intelligent Data Science Technologies and Applications (IDSTA) (pp. 66-73). IEEE -> https://ieeexplore.ieee.org/abstract/document/9660802?casa_token=LFJa20LrXAwAAAAA:wVwhTcCPWqxdloAvDQ3-l98KkkLx70xzG3zNvIIkJbC6wvJ4VxwX_VGc3mmW_7c1T-QJlOtTiao
Invited talk "Open Data as a driver of Society 5.0: how you and your scientif...Anastasija Nikiforova
This presentation is prepared as a part of my talk on the openness (open data and open science) in the context of Society 5.0 during the International Conference and Expo on Nanotechnology and Nanomaterials. It was very pleasant to receive an invitation to deliver the talk on my recently published article Smarter Open Government Data for Society 5.0: Are Your Open Data Smart Enough? (Sensors 2021, 21(15), 5204), which I have entitled as “Open Data as a driver of Society 5.0: how you and your scientific outputs can contribute to the development of the Super Smart Society and transformation into Smart Living?“. The paper has been briefly discussed in my previous post, thus, just a few words on this talk and overall experience.
AN EXTENDED DATA OBJECT-DRIVEN APPROACH TO DATA QUALITY EVALUATION: CONTEXTUA...Anastasija Nikiforova
This presentation is a supplementary material for the following article -> Nikiforova, A., & Bicevskis, J. (2019). An Extended Data Object-driven Approach to Data Quality Evaluation: Contextual Data Quality Analysis. In ICEIS (1) (pp. 274-281).
The research is an extension of a data object-driven approach to data quality evaluation allowing to analyse data object quality in scope of multiple data objects. Previously presented approach was used to analyse one particular data object, mainly focusing on syntactic analysis. It means that the primary data object quality can be analysed against secondary data objects of unlimited number. This opportunity allows making more comprehensive, in-depth contextual data object analysis. The given analysis was applied to open data, making comparison between previously obtained results and results of application of the extended approach, underlying importance and benefits of the given extension.
Slides of the presentation by Michael Martin (ULEI, INFAI) and Martin Kaltenböck (Semantic Web Company) at the OKCon2011 in Berlin on 30th of June 2011: The LOD2 Open Government Data Stakeholder Survey
In present societies, knowledge is known as the main source of Economic prosperity and Societies that derive their economical power from the production and diffusion of information and knowledge are referred to as knowledge-based societies or economies. This paper aimed to measure Triple Helix for studying the innovation infrastructure in Iran in compare with Netherlands, Russia, and Turkey. This research is based on Webometrics methods and we performed this research in two ways: first, we used the number of hits and co-occurrence of
“university”, “industry” and “government”.
Second, we
confined our search to Rich Files. In first way; the results show that in selected countries, “University”, “Industry” And “Government” are
more integrated in Netherlands following by Russia, Turkey and Iran in recent years. Iran in compare with other countries has no a good situation. In second way; the results show a different situation. Netherlands has higher value in this indicator, following by Turkey, Iran and Russia.
This document discusses augmenting open government data with social media data. It presents a research agenda to integrate open government data and social media data. The agenda involves understanding both data sources, integrating them based on common elements, and using a proof of concept involving UK election data from 2010 to demonstrate objective and subjective views as well as an integrated view. The goal is to provide infrastructure to better exploit the potential of open data and social media data.
1. The document discusses open data in Canada and argues that open civic data and information are important for informed citizen participation in decision-making and generating innovative solutions.
2. It provides examples of open data projects in various Canadian cities like child poverty mapping and Inuit land use atlases.
3. Key principles for opening government data are outlined, and challenges are noted around issues like interoperability, licensing, and cultural change needed for more open data.
Open data barometer global report - 2nd edition yann le gigan
This document provides an introduction and overview of the Open Data Barometer report. The report analyzes global trends in open data by assessing countries' readiness, implementation, and impact of open data initiatives. It finds that while open data initiatives have spread rapidly, more work is needed to support data-enabled democracy worldwide and ensure data access, skills, and freedoms are distributed equitably. The report evaluates 86 countries across different clusters and provides recommendations for tailoring open data strategies based on countries' varying capacities and needs. It aims to contribute to understanding challenges and opportunities in realizing open data's potential to increase transparency, empower citizens, and inspire innovation.
The linked open government data and metadata lifecycleOpen Data Support
This document discusses the lifecycle of linked open government data and metadata. It begins by examining existing data and metadata lifecycles, noting that they primarily focus on the supply side. It then presents a hybrid lifecycle model that includes both supply and demand sides. The supply side covers the selection, modeling, publishing and linking of data and metadata by governments. The demand side involves finding, integrating, reusing and providing feedback on open data by consumers. The document also provides best practices for publishing data and metadata at various stages of the lifecycle.
This document discusses the current state and issues surrounding public sector data in Korea. It defines open government data as data produced by government entities that can be freely used, reused and redistributed. While Korea has built various data through e-government initiatives, most data is not truly "open" as it cannot be freely reused or used for commercial purposes without restrictions. The document also outlines some international and domestic open data portals that have been established to increase access to government data.
This presentation was provided by Jim Hahn of The University of Pennsylvania, during the NISO event "Transforming Search: What the Information Community Can and Should Build." The virtual conference was held on August 26, 2020.
Risks, Harms and Benefits Assessment Tool (Updated as of Jan 2019)UN Global Pulse
The Data Innovation Risk Assessment Tool is an initial assessment of potential risks for data use that includes seven guiding checkpoints to understand: the "Data Type" involved in the data analytics process, the "Risks and Harms" of data use, the mode and legitimacy of "Data Access", the "Data Use", the adequacy of "Data Security", the adequate level of "Communication and Transparency" and the due diligence on engagement of "Third Parties". The Assessment contains guiding comments for each checkpoint and its questions are grounded in the key international data privacy and data protection principles and concepts such as Purpose Specification, Purpose Compatibility, Data Minimization, Consent Legitimacy, Lawfulness and Fairness of data access and use.
This presentation was provided by Sebastian Kohlmeier of The Allen Institute for AI (AI2), during the NISO event "Transforming Search: What the Information Community Can and Should Build." The virtual conference was held on August 26, 2020.
The document summarizes an ENGAGE workshop discussing barriers to open data reuse in Europe and findings from the ENGAGE project. Key points include:
1) Metadata was identified as a major barrier, with needs for a rich format to facilitate discovery and addressing issues like multilinguality, data ownership, and formats.
2) ENGAGE aims to provide a single access point and tools for researchers and citizens to discover, browse, download, visualize, and submit diverse public sector data sources.
3) ENGAGE 2.0 adds additional functionalities like dataset extension, conversion, cleansing, and crowdsourcing derived datasets while maintaining provenance information.
The document introduces linked data and describes how applying linked data principles such as using URIs and HTTP to identify and link pieces of data can improve the research and commercial utility of information. Examples are given of how linked data has been applied to clinical trial metadata and government spending data. The benefits of a top-down approach to standardization using shared vocabularies and ontologies are also discussed.
Acclerating biomedical discovery with an internet of FAIR data and services -...Michel Dumontier
With its focus on improving the health and well being of people, biomedicine has always been a fertile, if not challenging domain for computational discovery science. Indeed, the existence of millions of scientific articles, thousands of databases, and hundreds of ontologies, offer exciting opportunities to reuse our collective knowledge, were we not stymied by incompatible formats, overlapping and incomplete vocabularies, unclear licensing, and heterogeneous access points. In this talk, I will discuss our work to create computational standards, platforms, and methods to wrangle knowledge into simple, but effective representations based on semantic web technologies that are maximally FAIR - Findable, Accessible, Interoperable, and Reuseable - and to further use these for biomedical knowledge discovery. But only with additional crucial developments will this emerging Internet of FAIR data and services, which is built on Semantic Web technologies, be well positioned to support automated scientific discovery on a global scale.
Inferring Cuisine – Drug Interactions Using the Linked Data ApproachMilos Jovanovik
A talk I gave at the NASA Data Bootcamp 2015, to the participants of the NASA Space Apps Challenge 2015 in Skopje, Macedonia. I presented the latest research of our team, about how the negative effect of food on prescription drugs is spread out in different cuisines / parts of the world. I also talked about the basics of RDF and Linked Data, as well as some other R&D projects based on Linked Open Data, so that the participants of the hackathon can have a better overview of the possibilities for app development over such data.
Open Data Day 2016, Km4City, L’universita’ come aggregatore di Open Data del ...Paolo Nesi
Open Data Day, UNIMORE, Modena, 5 Marzo 2016.
Aggregazione dati, experienza di Firenze,
Smart City, Km4City,
Smart Decision Support,
Data Ingestion manager,
Data aggregation,
User profiling on demand.
Mobilità: inter-modalità, bigliettazione integrata, sostenibile, scambiatori, sfruttamento stazioni, etc.,
Servizi: gov ..SUAP, edu, turismo, beni culturali, salute, etc.,
Energia: risparmio energetico, riduzione amissioni, inquinamento, etc.,
Ambiente: qualità dell’aria, fiumi, meteo, rifiuti, etc.,
… commercio, industria, etc.
... Infrastrutture critiche. resilienza
Collezionamento dati statici, quasi statici e real time, stream
Dati open: geo localizzati, servizi, statistiche, censimenti, etc.
Dati privati degli operatori: con licenze limitate per non permettere di fare profitto ad altri operatori sulla base dei loro dati
Dati personali delle persone: profili, comportamenti tramite APP, IOT, sensori, web, etc.
Integrazione dati per renderli semanticamente interoperabili, ed operare deduzioni (time, space… )
I tradizionali collettori di open data danno visioni statistiche ma non sono adatti a produrre servizi integrati
Integrazione con modelli semantici unificanti come Km4City
Control Room delle Città Metropolitane devono:
arrivare a supervisionare domini multipli e le interdipendenze fra mobilità, energia, comunicazione, servizi, flussi traffico, flussi pedonali, turismo, etc.
Migliorare la loro Resilienza, capacità di reazione ed assorbimento
ridurre i costi sociali della mobilità per le persone
consentendo minori disagi, maggiore efficienza,
maggiore sensibilità verso le necessità del cittadino,
minori emissioni, migliori condizioni ambientali;
percorsi info-formativi in modo che il cittadino cambi le abitudini non virtuose;
ridurre i costi di trasporto ed i tempi di percorrenza per gli utenti, per i gestori e le amministrazioni, tramite soluzioni di ottimizzazione.
Putting the L in front: from Open Data to Linked Open DataMartin Kaltenböck
Keynote presentation of Martin Kaltenböck (LOD2 project, Semantic Web Company) at the Government Linked Data Workshop in the course of the OGD Camp 2011 in Warsaw, Poland: Putting the L in front: from Open Data to Linked Open Data
Dimitris Skoutas presents the OpenDataMonitor
Workshop title: Open Science Monitor
Workshop overview:
Which are the measurable components of Open Science? How do we build a trustworthy, global open science monitor? This workshop will discuss a potential framework to measure Open Science, including the path from the publishing of an open policy (registries of policies and how these are represented or machine read), to the use of open methodologies, and the opening up of research results, their recording and measurement.
DAY 2 - PARALLEL SESSION 5
Presentation describing the purpose of the European Data Portal project. The launching of the European Data Portal is one of the key steps the European Commission is taking in supporting the access to public data.
AN EXTENDED DATA OBJECT-DRIVEN APPROACH TO DATA QUALITY EVALUATION: CONTEXTUA...Anastasija Nikiforova
This presentation is a supplementary material for the following article -> Nikiforova, A., & Bicevskis, J. (2019). An Extended Data Object-driven Approach to Data Quality Evaluation: Contextual Data Quality Analysis. In ICEIS (1) (pp. 274-281).
The research is an extension of a data object-driven approach to data quality evaluation allowing to analyse data object quality in scope of multiple data objects. Previously presented approach was used to analyse one particular data object, mainly focusing on syntactic analysis. It means that the primary data object quality can be analysed against secondary data objects of unlimited number. This opportunity allows making more comprehensive, in-depth contextual data object analysis. The given analysis was applied to open data, making comparison between previously obtained results and results of application of the extended approach, underlying importance and benefits of the given extension.
Slides of the presentation by Michael Martin (ULEI, INFAI) and Martin Kaltenböck (Semantic Web Company) at the OKCon2011 in Berlin on 30th of June 2011: The LOD2 Open Government Data Stakeholder Survey
In present societies, knowledge is known as the main source of Economic prosperity and Societies that derive their economical power from the production and diffusion of information and knowledge are referred to as knowledge-based societies or economies. This paper aimed to measure Triple Helix for studying the innovation infrastructure in Iran in compare with Netherlands, Russia, and Turkey. This research is based on Webometrics methods and we performed this research in two ways: first, we used the number of hits and co-occurrence of
“university”, “industry” and “government”.
Second, we
confined our search to Rich Files. In first way; the results show that in selected countries, “University”, “Industry” And “Government” are
more integrated in Netherlands following by Russia, Turkey and Iran in recent years. Iran in compare with other countries has no a good situation. In second way; the results show a different situation. Netherlands has higher value in this indicator, following by Turkey, Iran and Russia.
This document discusses augmenting open government data with social media data. It presents a research agenda to integrate open government data and social media data. The agenda involves understanding both data sources, integrating them based on common elements, and using a proof of concept involving UK election data from 2010 to demonstrate objective and subjective views as well as an integrated view. The goal is to provide infrastructure to better exploit the potential of open data and social media data.
1. The document discusses open data in Canada and argues that open civic data and information are important for informed citizen participation in decision-making and generating innovative solutions.
2. It provides examples of open data projects in various Canadian cities like child poverty mapping and Inuit land use atlases.
3. Key principles for opening government data are outlined, and challenges are noted around issues like interoperability, licensing, and cultural change needed for more open data.
Open data barometer global report - 2nd edition yann le gigan
This document provides an introduction and overview of the Open Data Barometer report. The report analyzes global trends in open data by assessing countries' readiness, implementation, and impact of open data initiatives. It finds that while open data initiatives have spread rapidly, more work is needed to support data-enabled democracy worldwide and ensure data access, skills, and freedoms are distributed equitably. The report evaluates 86 countries across different clusters and provides recommendations for tailoring open data strategies based on countries' varying capacities and needs. It aims to contribute to understanding challenges and opportunities in realizing open data's potential to increase transparency, empower citizens, and inspire innovation.
The linked open government data and metadata lifecycleOpen Data Support
This document discusses the lifecycle of linked open government data and metadata. It begins by examining existing data and metadata lifecycles, noting that they primarily focus on the supply side. It then presents a hybrid lifecycle model that includes both supply and demand sides. The supply side covers the selection, modeling, publishing and linking of data and metadata by governments. The demand side involves finding, integrating, reusing and providing feedback on open data by consumers. The document also provides best practices for publishing data and metadata at various stages of the lifecycle.
This document discusses the current state and issues surrounding public sector data in Korea. It defines open government data as data produced by government entities that can be freely used, reused and redistributed. While Korea has built various data through e-government initiatives, most data is not truly "open" as it cannot be freely reused or used for commercial purposes without restrictions. The document also outlines some international and domestic open data portals that have been established to increase access to government data.
This presentation was provided by Jim Hahn of The University of Pennsylvania, during the NISO event "Transforming Search: What the Information Community Can and Should Build." The virtual conference was held on August 26, 2020.
Risks, Harms and Benefits Assessment Tool (Updated as of Jan 2019)UN Global Pulse
The Data Innovation Risk Assessment Tool is an initial assessment of potential risks for data use that includes seven guiding checkpoints to understand: the "Data Type" involved in the data analytics process, the "Risks and Harms" of data use, the mode and legitimacy of "Data Access", the "Data Use", the adequacy of "Data Security", the adequate level of "Communication and Transparency" and the due diligence on engagement of "Third Parties". The Assessment contains guiding comments for each checkpoint and its questions are grounded in the key international data privacy and data protection principles and concepts such as Purpose Specification, Purpose Compatibility, Data Minimization, Consent Legitimacy, Lawfulness and Fairness of data access and use.
This presentation was provided by Sebastian Kohlmeier of The Allen Institute for AI (AI2), during the NISO event "Transforming Search: What the Information Community Can and Should Build." The virtual conference was held on August 26, 2020.
The document summarizes an ENGAGE workshop discussing barriers to open data reuse in Europe and findings from the ENGAGE project. Key points include:
1) Metadata was identified as a major barrier, with needs for a rich format to facilitate discovery and addressing issues like multilinguality, data ownership, and formats.
2) ENGAGE aims to provide a single access point and tools for researchers and citizens to discover, browse, download, visualize, and submit diverse public sector data sources.
3) ENGAGE 2.0 adds additional functionalities like dataset extension, conversion, cleansing, and crowdsourcing derived datasets while maintaining provenance information.
The document introduces linked data and describes how applying linked data principles such as using URIs and HTTP to identify and link pieces of data can improve the research and commercial utility of information. Examples are given of how linked data has been applied to clinical trial metadata and government spending data. The benefits of a top-down approach to standardization using shared vocabularies and ontologies are also discussed.
Acclerating biomedical discovery with an internet of FAIR data and services -...Michel Dumontier
With its focus on improving the health and well being of people, biomedicine has always been a fertile, if not challenging domain for computational discovery science. Indeed, the existence of millions of scientific articles, thousands of databases, and hundreds of ontologies, offer exciting opportunities to reuse our collective knowledge, were we not stymied by incompatible formats, overlapping and incomplete vocabularies, unclear licensing, and heterogeneous access points. In this talk, I will discuss our work to create computational standards, platforms, and methods to wrangle knowledge into simple, but effective representations based on semantic web technologies that are maximally FAIR - Findable, Accessible, Interoperable, and Reuseable - and to further use these for biomedical knowledge discovery. But only with additional crucial developments will this emerging Internet of FAIR data and services, which is built on Semantic Web technologies, be well positioned to support automated scientific discovery on a global scale.
Inferring Cuisine – Drug Interactions Using the Linked Data ApproachMilos Jovanovik
A talk I gave at the NASA Data Bootcamp 2015, to the participants of the NASA Space Apps Challenge 2015 in Skopje, Macedonia. I presented the latest research of our team, about how the negative effect of food on prescription drugs is spread out in different cuisines / parts of the world. I also talked about the basics of RDF and Linked Data, as well as some other R&D projects based on Linked Open Data, so that the participants of the hackathon can have a better overview of the possibilities for app development over such data.
Open Data Day 2016, Km4City, L’universita’ come aggregatore di Open Data del ...Paolo Nesi
Open Data Day, UNIMORE, Modena, 5 Marzo 2016.
Aggregazione dati, experienza di Firenze,
Smart City, Km4City,
Smart Decision Support,
Data Ingestion manager,
Data aggregation,
User profiling on demand.
Mobilità: inter-modalità, bigliettazione integrata, sostenibile, scambiatori, sfruttamento stazioni, etc.,
Servizi: gov ..SUAP, edu, turismo, beni culturali, salute, etc.,
Energia: risparmio energetico, riduzione amissioni, inquinamento, etc.,
Ambiente: qualità dell’aria, fiumi, meteo, rifiuti, etc.,
… commercio, industria, etc.
... Infrastrutture critiche. resilienza
Collezionamento dati statici, quasi statici e real time, stream
Dati open: geo localizzati, servizi, statistiche, censimenti, etc.
Dati privati degli operatori: con licenze limitate per non permettere di fare profitto ad altri operatori sulla base dei loro dati
Dati personali delle persone: profili, comportamenti tramite APP, IOT, sensori, web, etc.
Integrazione dati per renderli semanticamente interoperabili, ed operare deduzioni (time, space… )
I tradizionali collettori di open data danno visioni statistiche ma non sono adatti a produrre servizi integrati
Integrazione con modelli semantici unificanti come Km4City
Control Room delle Città Metropolitane devono:
arrivare a supervisionare domini multipli e le interdipendenze fra mobilità, energia, comunicazione, servizi, flussi traffico, flussi pedonali, turismo, etc.
Migliorare la loro Resilienza, capacità di reazione ed assorbimento
ridurre i costi sociali della mobilità per le persone
consentendo minori disagi, maggiore efficienza,
maggiore sensibilità verso le necessità del cittadino,
minori emissioni, migliori condizioni ambientali;
percorsi info-formativi in modo che il cittadino cambi le abitudini non virtuose;
ridurre i costi di trasporto ed i tempi di percorrenza per gli utenti, per i gestori e le amministrazioni, tramite soluzioni di ottimizzazione.
Putting the L in front: from Open Data to Linked Open DataMartin Kaltenböck
Keynote presentation of Martin Kaltenböck (LOD2 project, Semantic Web Company) at the Government Linked Data Workshop in the course of the OGD Camp 2011 in Warsaw, Poland: Putting the L in front: from Open Data to Linked Open Data
Dimitris Skoutas presents the OpenDataMonitor
Workshop title: Open Science Monitor
Workshop overview:
Which are the measurable components of Open Science? How do we build a trustworthy, global open science monitor? This workshop will discuss a potential framework to measure Open Science, including the path from the publishing of an open policy (registries of policies and how these are represented or machine read), to the use of open methodologies, and the opening up of research results, their recording and measurement.
DAY 2 - PARALLEL SESSION 5
Presentation describing the purpose of the European Data Portal project. The launching of the European Data Portal is one of the key steps the European Commission is taking in supporting the access to public data.
Where Next for Open Data in the Russian FederationAndrew Stott
This document summarizes a World Bank report on open data in Russia. It shows that Russia has made progress in opening government data but more remains to be done. The report recommends focusing on three areas: using open data to spur economic growth and business innovation, building an open data ecosystem, and ensuring technical excellence. It provides details on actions under each area, such as releasing high-value business data, supporting startups, developing skills, and adopting open standards. The document ends by calling for sustained leadership and realistic targets to take open data forward in Russia.
This document provides an overview of open data and open science. It defines open data as data that can be freely used, modified, and shared by anyone for any purpose. Open data must be openly licensed, openly accessible, and in an open format. There are many types of open data including culture, science, finance, statistics, weather, and environment data. Open data benefits include increased access, transparency, and economic value. The document also discusses open research data repositories, preprints, data visualization tools, and recommendations for increasing open data in India based on its rankings in the Open Data Inventory.
The Open Data Economy Unlocking Economic Value by Opening Government and Publ...Capgemini
Few governments are leveraging open data for economic benefits
Governments and public authorities across the world are launching Open Data initiatives. Public administration officials are now beginning to realize the value that opening up data can have. For instance, the direct impact of Open Data on the EU27 economy was estimated at €32 Billion in 2010, with an estimated annual growth rate of 7%.
However, very few governments are taking the right measures in realizing the economic benefits of Open Data. This report examines the reasons behind this tendency, drawing on an analysis of 23 select countries across the world.
This document summarizes a workshop agenda for validating indicators for an Open Science Monitor. The workshop objectives are to validate the methodology for determining indicators on open access, open research data, and open collaboration. The methodology will be refined based on feedback to provide an evidence-based view of open science trends. While the community provides feedback, the consortium leading the project is ultimately responsible for the indicators. Unpaywall is proposed as an additional data source to help identify open access publications beyond what is currently found in Scopus and Web of Science.
Rank de países do G8 que melhor aplicam Open DataVinícius Barreto
The document discusses open data efforts in G8 countries based on their progress in meeting commitments under the 2013 G8 Open Data Charter. It finds that countries varied widely in their progress, with the UK scoring highest and Russia lowest. While all countries have established open data portals, there is substantial variation in factors like data quality and quantity, use of metadata standards, and efforts to engage civil society and promote innovation. Based on their commitments and progress, countries are categorized as strong, intermediate, or weak in their open data efforts. The benefits of open data are also summarized.
This deck of slides outlines the key aspects of the Open Data Readiness Assessment or ODRA and was presented in the consultative workshop on Rwanda Open Data Policy organized by the Ministry of Youth & ICT (GoR) and the World Bank.
This document provides an overview of open data policies and practices in 13 Asian countries. It analyzes each country's knowledge economy indicators, internet penetration rates, and open data readiness based on dimensions like leadership, political frameworks, data availability, and ecosystem development. Overall development levels vary widely in the region, from highly developed countries like Japan to less developed countries such as Cambodia and Myanmar.
Open Government Data: What it is, Where it is Going, and the Opportunities fo...OECD Governance
Keynote presentation given by Ryan Androsoff (Digital Government Policy Analyst, OECD) at the 2015 EUROSAI-OLACEFS conference in Quito, Ecuador on 25 June 2015. Focus of the presentation is on Open Government Data and the opportunities for Supreme Audit Institutions presented by open data. Video of the presentation is available at: https://youtu.be/SlBfxmecJhI?t=1h50m19s
For more information on OECD's work relating to Open Government Data please see: http://www.oecd.org/gov/public-innovation/open-government-data.htm
This presentation was made by Liliana Suchodolska , at the 3rd Experts Meeting on Gender Budgeting held at the OECD Conference Centre, Paris, on 19-20 September 2019
The document discusses open data and its benefits. It outlines 5 levels or "stars" of open data, with 5 stars being the most open. Open government data can include transportation and financial data, helping cities and giving citizens visibility. A pilot open data project is proposed, starting with one UNDP dataset to understand features and stakeholder needs before a larger launch. The pilot would test an API or open data platform over 2-3 months to inform a full open data service.
Connecting citizens with public data to drive policy changeMelissa Moody
UVA Data Science Institute Master of Science in Data Science researchers Lucas Beane and Elena Gillis undertook a capstone project to investigate possible reasons for the stagnation of the Charlottesville Open Data Portal.
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)OpenAIRE
The Research Data Alliance (RDA) is an international organization focused on data sharing across disciplines. It has over 8,600 members from 137 countries working to reduce barriers to data sharing through developing infrastructure and community activities. RDA has numerous active interest groups and working groups focused on issues like specific scientific domains, data reference and sharing, community needs, data stewardship, and basic infrastructure. One recent focus is guidelines for data sharing during the COVID-19 pandemic.
EDF2014: Taru Rastas, Senior Advisor, Ministry of Communications of Finland: ...European Data Forum
Selected Talk of Taru Rastas, Senior Advisor, Ministry of Communications of Finland at the European Data Forum 2014, 19 March 2014 in Athens, Greece: Open data for transport and communications
The Open Data Barometer aims to uncover the true prevalence and impact of open data initiatives around the world. It analyses global trends, and provides comparative data on countries and regions via an in-depth methodology combining contextual data, technical assessments and secondary indicators to explore multiple dimensions of open data readiness, implementation and impact.
This is the second edition of the Open Data Barometer, completing a two-year pilot of the Barometer methodology and providing data for comparative research. This report is just one expression of the Barometer, for which full data is also available, supporting secondary research into the progression of open data policies and practices across the world.
The Open Data Barometer forms part of the World Wide Web Foundation’s work on common assessment methods for open data.
A Tale of Open Data Innovations in Five Smart CitiesAdegboyega Ojo
Open Data initiatives are increasingly considered as defining elements of emerging smart cities. However, few studies have attempted to provide a better understanding of the nature of this convergence and the impact on both domains. This paper presents findings from a detailed study of 18 open data initiatives across five smart cities – Barcelona, Chicago, Manchester, Amsterdam and Helsinki. Specifically, the study sought to understand how open data programs are shaped by the different smart cities contexts and concomitantly what kinds of innovations are enabled by open data in these cities. The findings highlight the specific impacts of open data innovation on the different smart cities domains, governance of the cities, and the nature of datasets available in the open data ecosystem.
LIBER is a network of research libraries in Europe that aims to enable world-class research. The document discusses the benefits of open data and making research data FAIR (Findable, Accessible, Interoperable, Reusable). It notes that collaboration has increased in research and is important for addressing challenges. The vision is for open access and open data to be the default in research by 2022. Barriers to open data include cultural issues and lack of skills and infrastructure. The European Union supports open science and mandates open access and open data pilot programs. Research libraries provide services to support open data and FAIR principles.
Similar a Assessment of the usability of Latvia’s open data portal or how close are we to gaining benefits from open data (20)
Data Quality for AI or AI for Data quality: advances in Data Quality Manageme...Anastasija Nikiforova
“Data is the new oil” is only partly true, since according to Forbes, data is more than oil, while according to Ataccama, “Manual Data Quality Doesn’t Cut It in 2023” – this was the main driver behind of my guest lecture entitled “Data Quality for AI or AI for Data quality: advances in Data Quality Management for the success and sustainability of emerging technologies, business and society”, as part of which we discussed what is the role of artificial intelligence in data quality management and what is the role of data quality for AI, concluding that it is not about “data quality for AI” OR “AI for data quality” but rather about AND.
We also looked at what is the current market offer regarding AI-driven data quality management, what are the pros and cons of these solutions and what are the prerequisites that we have to take into account when using them (e.g., metadata and their quality for those, which derive DQ rules based on metadata analysis), and how possibly more promising solution could be built.
We also looked at what are those data quality specificities we should consider depending on the artifact – a data object (dataset), whose owner is known / is unknown (open data), Information System, Data Warehouse, Data Lake, Data Lakehouse, Data Mesh – where, when and how DQ takes place in them? What are the current trends? And are these indeed trends or rather hype?
Towards High-Value Datasets determination for data-driven development: a syst...Anastasija Nikiforova
Slides for the talk delivered as part of EGOV-CeDEM-ePart 2023 (EGOV2023) conference, aimed at examining how HVD determination has been reflected in the literature over the years and what has been found by these studies to date, incl. the indicators used in them, involved stakeholders, data-related aspects, and frameworks, which was done by conducting a Systematic Literature Review.
Read the paper here -> https://link.springer.com/chapter/10.1007/978-3-031-41138-0_14
Public data ecosystems in and for smart cities: how to make open / Big / smar...Anastasija Nikiforova
This is a set of slides used as part of my keynote "Public data ecosystems in and for smart cities: how to make open / Big / smart / geo data ecosystems value-adding for SDG-compliant Smart Living and Society 5.0" delivered at the 5th International Conference on Advanced Research Methods and Analytics (CARMA 2023) -> https://carmaconf2023.wordpress.com/keynote-speakers/. read more here -> https://anastasijanikiforova.com/2023/06/30/keynote-at-the-5th-international-conference-on-advanced-research-methods-and-analytics-carma-2023/
Artificial Intelligence for open data or open data for artificial intelligence?Anastasija Nikiforova
This is a presentation used to deliver an invited talk for Babu Banarasi Das University (BBDU, Department of Computer Science and Engineering) Development Program «Artificial Intelligence for Sustainable Development» organized by AI Research Centre, Department of Computer Science & Engineering, ShodhGuru Research Labs, Soft Computing Research Society, IEEE UP Section, Computational Intelligence Society Chapter in 2022. Read more here -> https://anastasijanikiforova.com/2022/09/24/ai-for-open-data-or-open-data-for-ai-an-invited-talk-for-bbdu-development-program-artificial-intelligence-for-sustainable-development%f0%9f%8e%a4/
Overlooked aspects of data governance: workflow framework for enterprise data...Anastasija Nikiforova
This presentation is a supplementary material for the article "Overlooked aspects of data governance: workflow framework for enterprise data deduplication" (Azeroual, Nikiforova, Shei) presented at The International Conference on Intelligent Computing, Communication, Networking and Services (ICCNS2023).
Abstract of the paper: Data quality in companies is decisive and critical to the benefits their products and services can provide. However, in heterogeneous IT infrastructures where, e.g., different applications for Enterprise Resource Planning (ERP), Customer Relationship Management (CRM), product management, manufacturing, and marketing are used, duplicates, e.g., multiple entries for the same customer or product in a database or information system, occur. There can be several reasons for this, but the result of non-unique or duplicate records is a degraded data quality. This ultimately leads to poorer, inefficient, and inaccurate data-driven decisions. For this reason, in this paper, we develop a conceptual data governance framework for effective and efficient management of duplicate data, and improvement of data accuracy and consistency in large data ecosystems. We present methods and recommendations for companies to deal with duplicate data in a meaningful way.
Data Quality as a prerequisite for you business success: when should I start ...Anastasija Nikiforova
These are slides for my talk "Data Quality as a prerequisite for you business success: when should I start taking care of it?" I delivered as an invited keynote for HackCodeX Forum that gathered international experts to share their experience and knowledge on the emerging technologies and areas such as Artificial Intelligence, Security, Data Quality, Quantum Computing, Sustainability, Open Data, Privacy etc.
Framework for understanding quantum computing use cases from a multidisciplin...Anastasija Nikiforova
This presentation is a supplementary material for the article "Framework for understanding quantum computing use cases from a multidisciplinary perspective and future research directions" (Ukpabi, D.C., Karjaluoto, H., Botticher, A., Nikiforova, A., Petrescu, D.I., Schindler, P., Valtenbergs, V., Lehmann, L., & Yakaryılmaz, A) available at https://arxiv.org/ftp/arxiv/papers/2212/2212.13909.pdf. THe presentation, however, was delivered for QWorld Quantum Science Days 2023 | May 29-31.
Data Lake or Data Warehouse? Data Cleaning or Data Wrangling? How to Ensure t...Anastasija Nikiforova
This presentation was delivered as part of the Data Science Seminar titled “When, Why and How? The Importance of Business Intelligence“ organized by the Institute of Computer Science (University of Tartu) in cooperation with Swedbank.
In this presentation I talked about:
*“Data warehouse vs. data lake – what are they and what is the difference between them?” (structured vs unstructured, static vs dynamic (real-time data), schema-on-write vs schema on-read, ETL vs ELT) with further elaboration on What are their goals and purposes? What is their target audience? What are their pros and cons?
*“Is the Data warehouse the only data repository suitable for BI?” – no, (today) data lakes can also be suitable. And even more, both are considered the key to “a single version of the truth”. Although, if descriptive BI is the only purpose, it might still be better to stay within data warehouse. But, if you want to either have predictive BI or use your data for ML (or do not have a specific idea on how you want to use the data, but want to be able to explore your data effectively and efficiently), you know that a data warehouse might not be the best option.
*“So, the data lake will save my resources a lot, because I do not have to worry about how to store /allocate the data – just put it in one storage and voila?!” – no, in this case your data lake will turn into a data swamp! And you are forgetting about the data quality you should (must!) be thinking of!
*“But how do you prevent the data lake from becoming a data swamp?” – in short and simple terms – proper data governance & metadata management is the answer (but not as easy as it sounds – do not forget about your data engineer and be friendly with him [always… literally always :D) and also think about the culture in your organization.
*“So, the use of a data warehouse is the key to high quality data?” – no, it is not! Having ETL do not guarantee the quality of your data (transform&load is not data quality management). Think about data quality regardless of the repository!
*“Are data warehouses and data lakes the only options to consider or are we missing something?“– true! Data lakehouse!
*“If a data lakehouse is a combination of benefits of a data warehouse and data lake, is it a silver bullet?“– no, it is not! This is another option (relatively immature) to consider that may be the best bit for you, but not a panacea. Dealing with data is not easy (still)…
In addition, in this talk I also briefly introduced the ongoing research into the integration of the data lake as a data repository and data wrangling seeking for an increased data quality in IS. In short, this is somewhat like an improved data lakehouse, where we emphasize the need of data governance and data wrangling to be integrated to really get the benefits that the data lakehouses promise (although we still call it a data lake, since a data lakehouse is nut sufficiently mature concept with different definitions of it).
Putting FAIR Principles in the Context of Research Information: FAIRness for ...Anastasija Nikiforova
This presentation is a supplementary material for "Putting FAIR Principles in the Context of Research Information: FAIRness for CRIS and CRIS for FAIRness" (Otmane Azeroual, Joachim Schopfel, Janne Polonen, and Anastasija Nikiforova) paper presented at 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K) conference, and also received the Best Paper Award. In this presentation we raise a discussion on this topic showing that the improvement of FAIRness is a dual or bidirectional process, where CRIS promotes and contributes to the FAIRness of data and infrastructures, and FAIR principles push for further improvement in the underlying CRIS data model and format, positively affecting the sustainability of these systems and underlying artifacts. CRIS are beneficial for FAIR, and FAIR is beneficial for CRIS.
See the text here -> https://www.scitepress.org/Link.aspx?doi=10.5220/0011548700003335
Cite as -> Azeroual, O.; Schöpfel, J.; Pölönen, J. and Nikiforova, A. (2022). Putting FAIR Principles in the Context of Research Information: FAIRness for CRIS and CRIS for FAIRness. In Proceedings of the 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KMIS, ISBN 978-989-758-614-9; ISSN 2184-3228, pages 63-71. DOI: 10.5220/0011548700003335
Open data hackathon as a tool for increased engagement of Generation Z: to h...Anastasija Nikiforova
This is presentation for the paper "Open data hackathon as a tool for increased engagement of Generation Z: to hack or not to hack?" presented at EGETC2022.
A hackathon is known as a form of civic innovation in which participants representing citizens can point out existing problems or social needs and propose a solution. Given the high social, technical, and economic potential of open government data (OGD), the concept of open data hackathons is becoming popular around the world. This concept has become popular in Latvia with the annual hackathons organised for a specific cluster of citizens – Generation Z. This study presents the latest findings on the role of open data hackathons and the benefits that they can bring to both the society, participants, and government. First, a systematic literature review is carried out to establish a knowledge base. Then, empirical research of 4 case studies of open data hackathons for Generation Z participants held between 2018 and 2021 in Latvia is conducted to understand which ideas dominated and what were the main results of these events for the OGD initiative. It demonstrates that, despite the widespread belief that young people are indifferent to current
societal and natural problems, the ideas developed correspond to current situation and are aimed at solving them, revealing aspects for improvement in both the
provision of data, infrastructure, culture, and government- related areas.
Barriers to Openly Sharing Government Data: Towards an Open Data-adapted Inno...Anastasija Nikiforova
This is the presentation for our ongoing study "Barriers to Openly Sharing Government Data: Towards an Open Data-adapted Innovation Resistance Theory" (Anastasija Nikiforova, Anneke Zuiderwijk) presented at ICEGOV2022 conference – 15th International Conference on Theory and Practice of Electronic Governance (nominated to the Best Paper Awards).
In short, the study aims to develop an Open Government Data-adapted Innovation Resistance Theory model to empirically identify predictors affecting public agencies’ resistance to openly sharing government data. Here we want to understand:
💡what are functional and behavioural factors that facilitate or hamper opening government data by public organizations?
💡does IRT provide a new and more complete insight compared to more traditional UTAUT and TAM? – IRT has not been applied in this domain, yet, so we are checking whether it should be considered, or rather those models we are familiar so much are the best ones?
💡and additionally – does the COVID-19 pandemic had an [obvious/significant] effect on the public agencies in terms of their readiness or resistance to openly share government data?
Based on a review of the literature on both IRT research and barriers associated with open data sharing by public agencies, we developed an initial version of the model. Once the model is refined in a qualitative study (interviews with public agencies), we will validate it to study the resistance of public authorities to openly sharing government data in a quantitative study.
Read the paper and cite as -> Nikiforova A., Zuiderwijk A. (2022) Barriers to openly sharing government data: towards an open data-adapted innovation resistance theory, In 15th International Conference on Theory and Practice of Electronic Governance (ICEGOV 2022). Association for Computing Machinery, New York, NY, USA, 215–220, https://doi.org/10.1145/3560107.3560143 – best paper award nominee
Combining Data Lake and Data Wrangling for Ensuring Data Quality in CRISAnastasija Nikiforova
This presentation is a supplementary material for the "Combining Data Lake and Data Wrangling for Ensuring Data Quality in CRIS" presented at 15th International Conference on Current Research Information Systems (CRIS2022) - Linking Research Information across data spaces. It provides an insight on the ongoing study of combining data lake as a data repository and data wrangling seeking for an increased data quality in CRIS systems, although the proposed approach is domain-agnostic and can be used not only within CRIS.
Read the article here -> Azeroual, O., Schöpfel, J., Ivanovic, D., & Nikiforova, A. (2022, May). Combining Data Lake and Data Wrangling for Ensuring Data Quality in CRIS. In CRIS2022: 15th International Conference on Current Research Information Systems --> https://hal.archives-ouvertes.fr/hal-03694519/
The role of open data in the development of sustainable smart cities and smar...Anastasija Nikiforova
This presentation is a supplementary material for the guest lecture "The role of open data in the development of sustainable smart cities and smart society" I delivered for the Federal University of Technology – Paraná (Universidade Tecnológica Federal do Paraná (UTFPR)) (Brazil, May 2022).
Data security as a top priority in the digital world: preserve data value by ...Anastasija Nikiforova
Today, in the age of information and Industry 4.0, billions of data sources, including but not limited to interconnected devices (sensors, monitoring devices) forming Cyber-Physical Systems (CPS) and the Internet of Things (IoT) ecosystem, continuously generate, collect, process, and exchange data. With the rapid increase in the number of devices and information systems in use, the amount of data is increasing. Moreover, due to the digitization and variety of data being continuously produced and processed with a reference to Big Data, their value, is also growing. As a result, the risk of security breaches and data leaks. The value of data, however, is dependent on several factors, where data quality and data security that can affect the data quality if the data are accessed and corrupted, are the most vital. Data serve as the basis for decision-making, input for models, forecasts, simulations etc., which can be of high strategical and commercial / business value. This has become even more relevant in terms of COVID-19 pandemic, when in addition to affecting the health, lives, and lifestyle of billions of citizens globally, making it even more digitized, it has had a significant impact on business. This is especially the case because of challenges companies have faced in maintaining business continuity in this so-called “new normal”. However, in addition to those cybersecurity threats that are caused by changes directly related to the pandemic and its consequences, many previously known threats have become even more desirable targets for intruders, hackers. Every year millions of personal records become available online. Moreover, the popularity of IoTSE decreased a level of complexity of searching for connected devices on the internet and easy access even for novices due to the widespread popularity of step-by-step guides on how to use IoT search engine to find and gain access if insufficiently protected to webcams, routers, databases and other artifacts. A recent research demonstrated that weak data and database protection in particular is one of the key security threats. Various measures can be taken to address the issue. The aim of the study to which this presentation refers is to examine whether “traditional” vulnerability registries provide a sufficiently comprehensive view of DBMS security, or whether they should be intensively and dynamically inspected by DBMS holders by referring to Internet of Things Search Engines moving towards a sustainable and resilient digitized environment. The paper brings attention to this problem and make the reader think about data security before looking for and introducing more advanced security and protection mechanisms, which, in the absence of the above, may bring no value.
ShoBeVODSDT: Shodan and Binary Edge based vulnerable open data sources detect...Anastasija Nikiforova
This presentation is devoted to the "ShoBeVODSDT: Shodan and Binary Edge based vulnerable open data sources detection tool or what Internet of Things Search Engines know about you" research paper developed by Artjoms Daskevics and Anastasija Nikiforova and presented during the The International Conference on Intelligent Data Science Technologies and Applications (IDSTA2021), November 15-16, 2021. Tartu, Estonia (web-based).
Read paper here -> Daskevics, A., & Nikiforova, A. (2021, November). ShoBeVODSDT: Shodan and Binary Edge based vulnerable open data sources detection tool or what Internet of Things Search Engines know about you. In 2021 Second International Conference on Intelligent Data Science Technologies and Applications (IDSTA) (pp. 38-45). IEEE.
Atvērtā lekcija "Atvērto datu potenciāls" notika LU SZF maģistrantūras kursa “Datu sabiedrības vadība” ietvaros, ko nolasīja Dr.sc.comp. Anastasija Ņikiforova, LU Datorikas fakultātes docente, pētniece.
Atvērtie dati tiek uzskatīti par vērtīgu resursu, kura izmantošana ir potenciāli spējīga sniegt ievērojamus ekonomiskus, tehnoloģiskus un sociālus ieguvumus. Taču to panākšanai ir jāizpildās virknei priekšnosacījumu, kas attiecināmi gan uz datiem, gan uz infrastruktūru, gan uz lietotājiem, t.i. atvērto datu iniciatīvas veiksmes faktors ir ilgtspējīgas atvērto pārvaldes datu ekosistēmas izveide un uzturēšana. Lekcijas mērķis ir sniegt ieskatu par atvērto datu popularitāti un potenciālu tehnoloģisko un ekonomisko procesu attīstībai, uzmanību pievēršot to praktiskiem pielietojumiem gan Latvijā, gan ārpus tās, datus transformējot (inovatīvajos) risinājumos un pakalpojumos. Tāpat, ir plānots sniegts ieskatu par nozīmīgākajiem aspektiem, kas potenciāli ir spējīgi sekmēt ilgtspējīgas atvērto datu ekosistēmas izveidi, nodrošinot iespēju ikvienam interesentam atvērtus datus transformēt vērtībā.
PhD, Dc. comp.sc. Anastasija Ņikiforova ir Latvijas Universitātes Datorikas Fakultātes docente un Inovatīvo informācijas tehnoloģiju laboratorijas pētniece. Dr. Ņikiforovas pētnieciskas intereses ir saistītas ar datu pārvaldības, īpaši datu kvalitātes, un atvērto datu saistītājiem jautājumiem. LU Datorikas fakultātē papildus citiem docētājiem kursiem viņa ir izstrādājusi Specsemināru “Atvērtie dati un datu kvalitāte” un maģistra programmas kursu “Atvērtie pārvaldes dati datu-virzītā pasaulē”. Dr. Ņikiforova ir Latvijas Zinātnes padomes eksperte Inženierzinātnes un tehnoloģijas (Elektrotehnika, elektronika, informācijas un komunikāciju tehnoloģijas) un Dabaszinātnes (Datorzinātnes un informātika) nozarēs, kā arī LATA (Latvijas Atvērto Tehnoloģiju Asociācija) asociētā biedre. Viņa ir vairāk kā 25 zinātnisko rakstu (līdz-)autore, 4 no kuriem ir publicēti augstākā rangā Q1 žurnālos.
This presentation is a supplementary material for the following article -> Nikiforova, A., Bicevskis, J., & Karnitis, G. (2020, December). Towards a Concurrence Analysis in Business Processes. In 2020 Seventh International Conference on Social Networks Analysis, Management and Security (SNAMS) (pp. 1-6). IEEE.
This paper presents first steps towards a solution aimed to provide concurrent business processes analysis methodology for predicting the probability of incorrect business process execution. The aim of the paper is to (a) look at approaches to describing and dealing with the execution of concurrent processes, mainly focusing on the transaction mechanisms in database management systems, (b) present an idea and a preliminary version of an algorithm that detects the possibility of incorrect execution of concurrent business processes. Analyzing business process according to the proposed procedure allows to configure transaction processing optimally.
DATA QUALITY MODEL-BASED TESTING OF INFORMATION SYSTEMS: THE USE-CASE OF E-SC...Anastasija Nikiforova
This presentation is a supplementary material for the following article -> Nikiforova, A., Bicevskis, J., Bicevska, Z., & Oditis, I. (2020, December). Data quality model-based testing of information systems: the use-case of E-scooters. In 2020 7th International Conference on Internet of Things: Systems, Management and Security (IOTSMS) (pp. 1-8). IEEE.
The paper proposes a data quality model-based testing methodology aimed at improving testing methodology of information systems (IS) using previously proposed data quality model. The solution supposes creation of a description of the data to be processed by IS and the data quality requirements used for the development of the tests, followed by performing an automated test of the system on the generated tests verifying the correctness of data to be entered and stored in the database. The generation of tests for all possible data quality conditions creates a complete set of tests that verify the operation of the IS under all possible data quality conditions. The proposed solution is demonstrated by the real example of the system dealing with e-scooters. Although the proposed solution is demonstrated by applying it to the system that is already in use, it can also be used when developing a new system.
This paper is a supplementary material for the following article -> Bicevskis, J., Nikiforova, A., Bicevska, Z., Oditis, I., & Karnitis, G. (2019, October). A step towards a data quality theory. In 2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS) (pp. 303-308). IEEE.
Data quality issues have been topical for many decades. However, a unified data quality theory has not been proposed yet, since many concepts associated with the term “data quality” are not straightforward enough. The paper proposes a user-oriented data quality theory based on clearly defined concepts. The concepts are defined by using three groups of domain-specific languages (DSLs): (1) the first group uses the concept of a data object to describe the data to be analysed, (2) the second group describes the data quality requirements, and (3) the third group describes the process of data quality evaluation. The proposed idea proved to be simple enough, but at the same time very effective in identifying data defects, despite the different structures of data sets and the complexity of data. Approbation of the approach demonstrated several advantages: (a) a graphical data quality model allows defining of data quality even by non-IT and non-data quality professionals, (b) data quality model is not related to the information system that has accumulated data, i.e., this approach lets users analyse the “third-party” data, and (c) data quality can be described at least at two levels of abstraction - informally, using natural language, or formally, including executable program routines or SQL statements.
The paper proposes a user-oriented data quality theory based on clearly defined concepts. The concepts are defined byusing three groups of domain-specific languages(DSLs): (1) the first group usestheconcept of a data object to describe the data to be analysed, (2) the second group describes the data quality requirements, and (3) the third group describes the process of data quality evaluation. The proposed ideaproved to be simple enough,but at the same time very effectivein identifyingdata defects, despitethedifferent structures of data sets andthe complexity ofdata. Approbation of the approach demonstratedseveral advantages: (a) a graphical data quality model allows defining of data quality even by non-IT and non-data qualityprofessionals, (b) data quality model is not related to the information system that has accumulated data, i.e., this approach letsusers analysethe"third-party” data, and (c) data quality can be described at least attwo levelsof abstraction –informally,using natural language,or formally,including executable program routines or SQL statements.
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataKiwi Creative
Harness the power of AI-backed reports, benchmarking and data analysis to predict trends and detect anomalies in your marketing efforts.
Peter Caputa, CEO at Databox, reveals how you can discover the strategies and tools to increase your growth rate (and margins!).
From metrics to track to data habits to pick up, enhance your reporting for powerful insights to improve your B2B tech company's marketing.
- - -
This is the webinar recording from the June 2024 HubSpot User Group (HUG) for B2B Technology USA.
Watch the video recording at https://youtu.be/5vjwGfPN9lw
Sign up for future HUG events at https://events.hubspot.com/b2b-technology-usa/
Codeless Generative AI Pipelines
(GenAI with Milvus)
https://ml.dssconf.pl/user.html#!/lecture/DSSML24-041a/rate
Discover the potential of real-time streaming in the context of GenAI as we delve into the intricacies of Apache NiFi and its capabilities. Learn how this tool can significantly simplify the data engineering workflow for GenAI applications, allowing you to focus on the creative aspects rather than the technical complexities. I will guide you through practical examples and use cases, showing the impact of automation on prompt building. From data ingestion to transformation and delivery, witness how Apache NiFi streamlines the entire pipeline, ensuring a smooth and hassle-free experience.
Timothy Spann
https://www.youtube.com/@FLaNK-Stack
https://medium.com/@tspann
https://www.datainmotion.dev/
milvus, unstructured data, vector database, zilliz, cloud, vectors, python, deep learning, generative ai, genai, nifi, kafka, flink, streaming, iot, edge
Global Situational Awareness of A.I. and where its headedvikram sood
You can see the future first in San Francisco.
Over the past year, the talk of the town has shifted from $10 billion compute clusters to $100 billion clusters to trillion-dollar clusters. Every six months another zero is added to the boardroom plans. Behind the scenes, there’s a fierce scramble to secure every power contract still available for the rest of the decade, every voltage transformer that can possibly be procured. American big business is gearing up to pour trillions of dollars into a long-unseen mobilization of American industrial might. By the end of the decade, American electricity production will have grown tens of percent; from the shale fields of Pennsylvania to the solar farms of Nevada, hundreds of millions of GPUs will hum.
The AGI race has begun. We are building machines that can think and reason. By 2025/26, these machines will outpace college graduates. By the end of the decade, they will be smarter than you or I; we will have superintelligence, in the true sense of the word. Along the way, national security forces not seen in half a century will be un-leashed, and before long, The Project will be on. If we’re lucky, we’ll be in an all-out race with the CCP; if we’re unlucky, an all-out war.
Everyone is now talking about AI, but few have the faintest glimmer of what is about to hit them. Nvidia analysts still think 2024 might be close to the peak. Mainstream pundits are stuck on the wilful blindness of “it’s just predicting the next word”. They see only hype and business-as-usual; at most they entertain another internet-scale technological change.
Before long, the world will wake up. But right now, there are perhaps a few hundred people, most of them in San Francisco and the AI labs, that have situational awareness. Through whatever peculiar forces of fate, I have found myself amongst them. A few years ago, these people were derided as crazy—but they trusted the trendlines, which allowed them to correctly predict the AI advances of the past few years. Whether these people are also right about the next few years remains to be seen. But these are very smart people—the smartest people I have ever met—and they are the ones building this technology. Perhaps they will be an odd footnote in history, or perhaps they will go down in history like Szilard and Oppenheimer and Teller. If they are seeing the future even close to correctly, we are in for a wild ride.
Let me tell you what we see.
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...Social Samosa
The Modern Marketing Reckoner (MMR) is a comprehensive resource packed with POVs from 60+ industry leaders on how AI is transforming the 4 key pillars of marketing – product, place, price and promotions.
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"sameer shah
Embark on a captivating financial journey with 'Financial Odyssey,' our hackathon project. Delve deep into the past performance of two companies as we employ an array of financial statement analysis techniques. From ratio analysis to trend analysis, uncover insights crucial for informed decision-making in the dynamic world of finance."
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
The Ipsos - AI - Monitor 2024 Report.pdfSocial Samosa
According to Ipsos AI Monitor's 2024 report, 65% Indians said that products and services using AI have profoundly changed their daily life in the past 3-5 years.
Challenges of Nation Building-1.pptx with more important
Assessment of the usability of Latvia’s open data portal or how close are we to gaining benefits from open data
1. ASSESSMENT OF THE USABILITY OF LATVIA’S OPEN
DATA PORTAL OR HOW CLOSE ARE WE TO GAINING
BENEFITS FROM OPEN DATA
14th International Conference on Interfaces and Human Computer Interaction (part of 14th Multi Conference on Computer Science and Information Systems)
21 – 23 July 2020
Anastasija Nikiforova
Faculty of Computing, University of Latvia
Anastasija.Nikiforova@lu.lv
2. in order to benefit from the open data, a sufficiently high-quality and user-friendly open data portal should be provided,
allowing access to the data for all stakeholders, which would result in the transformation of the data into value and
knowledge for the society;
despite many countries develop and launch their own OGD portals, they received a great deal of criticism from both, society
and technical experts - Latvia is not an exception;
since the Latvian open data portal is relatively new, it is not included in other studies, while other portals of our neighbours
(Lithuania, Estonia, Finland) have been investigated;
in order to bring added value at the international level, this study provides brief comparison with other portals,
highlighting the aspects to be taken into account when developing and improving open data portals.
RATIONALE FOR THE STUDY
Aim: to explore Latvia’s Open Data portal, by evaluating it from different perspectives.
Research questions: whether our Open Data Portal is of adequate quality?
what are the main challenges to be addressed?
3. the usability of open data portals is being studied
even less frequently - over the last 5 years, only
6% of studies covering open data portal topic at
least a little, mention the usability;
compared to the leading countries, the topic of
Latvia’s open data portal is covered nearly 12
times rarely (1.33% of all studies, while for the
leading countries this ratio exceeds 15%).;
the popularity of this topic in scientific literature
may sometimes (but not necessarily) be linked to
the state of the open data portals, as the number
of studies demonstrates the public’s (at least the
scientific part of society) interest in the subject.
STATE OF THE ART
despite the increase in popularity of the topics of the
OGD and open data portals, the ratio of these
researches to the researches dealing with open data
does not exceed 7%
4. Existing studies:
✘ cover their own national OGD initiatives, which in most cases turns to the an assessment of national open data
portal(s), assessing them from different perspectives such as:
data, functionality, features,
stakeholder participation, stakeholder feedback;
the relevance of the data sets of a portal to the “5-stars” classification etc..
✘ focus mainly on data delivery and the data environment, considering what data providers have done to facilitate
users, but have not actually consulted users;
✘ lack a user perspective;
✘ lack a common methodology that would allow comparisons between studies and portals.
STATE OF THE ART
5. STATE OF THE ART. BENCHMARKS AND INDEXES
Global Open Data Index follows the state of the OGD of 94 countries analysing 15 key datasets per country,
only 11% of the data set entries were open according to their open definition,
✘ data are currently available only for 2016 (the evaluation process is quite complex), their finding may be considered
outdated;
Open Data Barometer provides a snapshot of OGD practices focusing on open data readiness, implementation, and emerging
impacts.
✘ the most urgent, the 4th edition assessed these aspects for a sample of 30 countries in 2017 (113 countries in 2016);
EU Open Data portal assesses 4 key aspects, namely policy, portal, impact and quality.
✔ one of the most up-to-date assessments (is used in the study)
etc.
6. BRIEFLY ABOUT THE PORTAL
Date of the launch: 2017
Owner: Latvian Ministry of Environmental Protection and Regional Development
Rank (according to European Data Portal Maturity Report): 11
Number of datasets: 413*
Number of publishers: 80*
Number of categories: 14
*July 24
7. Latvia is one of 70 countries participating in
the Open Government Partnership - an
international platform for domestic reformers that
committed to making their governments more open,
accountable, and responsive to citizens;
BRIEFLY ABOUT THE PORTAL
at the moment of its launch
33 data sets
from 13 data publishers
in July of 2018
139 data sets
from 41 publishers
in July of 2020
413 data sets
from 80 publishers.
Latvian open data portal has the highest rate of open data maturity in comparison
with neighbourhoods from Baltic States and Scandinavian countries.
Languages: Latvian and English
✔ attempt to ensure multilingualism
✘ content uniformity/ homogenity in different languages is not ensured
The most popular data formats: .CSV, .XLSX, .WMS
✔ data are machine-readable (is still a problem for many countries)
✔ the number of geospatial data increases significantly – the nature of data is up-to-date
8. The impact and portal for Latvia are the
worst aspects among impact, policy,
portal, and quality (65 and 78%);
Maturity of Latvian open data portal:
• in 2016 – 30th,
• in 2017 – 21st,
• in 2018 - 19th,
• in 2018 - 11th,
For more detail see Nikiforova, A. (2020) - Comparative analysis of national
open data portals or whether your portal is ready to bring benefits from open data.
In IADIS International Conference on International Conference on ICT, Society
and Human Beings
Rank Country 2019 2018 2017 2016
1 France 1 ↑+4 5 ↑+3 8 ↓-5 3
2 Spain 2 ↑+6 8 ↓-3 5 ↓-1 4
3 Ireland 3 3 3 ↑+7 10
4 Cyprus 4 ↓-2 2↑+22 24↓-6 18
5 Finland 5 ↓-4 1 ↑+5 6 ↑+3 9
6 Slovenia 6 ↑+1 7↑+13 20↓-7 13
7 Austria 7 ↑+3 10↓-1 9 ↓-4 5
8 Romania 8 ↓-4 4 4↑+11 15
9 Luxembourg 9 9 ↓-8 1 1
10 Netherlands 10↑+4 14↓-4 10↓-3 7
11 Latvia 11↑+8 19↑+2 21↑+9 30
12 Poland 12↓-1 11↑+12 23↑+1 24
13 Italy 13 13 13↑+6 19
14 Germany 14↑+9 25↓-23 2 ↑+4 6
15 Greece 15↓-9 6↑+20 26 26
16 Croatia 16↑+4 20↓-13 7 ↑+5 12
THE MATURITY OF OPEN DATA PORTALS
ACCORDING TO EUROPEAN DATA PORTAL
↓-n – decrease in rank by n positions compared to the previous year
↑+n – improve in rank by n positions compared to the previous year
ranking – below EU28+ average
*The analysis is carried out by author
✓ Latvia demonstrates the
most impressive result in
terms of continuous
development;
✓ only 4 countries improve
their positions from year to
year – Ireland, Latvia, Italy,
and Malta
9. Why? (Máchová et al., 2018)*** usability evaluation framework
✔ reflects all the functionality of the portal and typical tasks normally performed by users;
✔ considers a user perspective;
✔ complies with the majority principles used or mentioned by other well-known researchers ((Charalabidis et al., 2018),
(Attard et al., 2015), (Zuiderwijk et al., 2015));
✔ well-cited by other researchers.
What does it suppose?
3 categories (data specification, data set feedback, data set request) 14 aspects (see the next slide
for detail)
3-point Likert scale (1 – not fullfilled, 2 – partly, 3 - fulfilled)
***Máchová, R. et al. (2018). Usability evaluation of open data portals. Aslib Journal of Information Management
STAGE I:
a questionnaire of
Latvian citizens
(105 participants)
STAGE II:
(Máchová et al., 2018)*** usability evaluation
framework applied to the portal by 35 participants
with IT background*
*well-designed experiment requires at least 30 participants - ✔
STAGE III:
additional aspects not covered in the
applied methodology (or not detailed
enough) analysed by addressing
every data set
(395 data sets)
KEY POINTS
10. the number of non-IT experts who are aware of the
existence of Latvia’s open data portal is higher than
the number of users with IT background by 13%,
this trend is not valid in the case of its use -
only 7.7% of non-IT users use this portal regularly,
while for IT-users this ratio is 12%.
the assessment received from IT users was lower by 1
point in the case of usability and 0.5 point in the case
of quality.
105 voluntary participants representing three groups: (a) users with IT background,
(b) without IT background, divided into (b1) undergraduated and (b2) graduated.
A list of simple question, including:
(1) whether the participant knows that Latvia has its own national open data portal?
(2) how he/ she assesses it (if «yes» for the 1st question) (2.1.) from a “usability”
perspective, (2.2.) from the quality of the data.
only 53% of respondents are aware of the existence of Latvia’s open data portal,
13.5% suppose that this exists, however, are not sure,
74.3% of those, who know about the existence of the portal, have not used it,
18% used it only once and only 7.7% use it in a daily manner.
the overall [subjective] usability of the portal was assessed by 6.3 points out of 10, while the
quality of published data – 6.2 out of 10.
Despite the high results according to international assessment systems,
Latvian users are not satisfied with this portal in the highest way.
STAGE I: SURVEY
11. Category Aspect Description
Open dataset
specification
a) Description of dataset Portal provides datasets together with their description and how and for what purpose they were collected
b) Publisher of dataset Portal provides information about organization that published datasets
c) Thematic categories and tags
Portal provides thematic categories of datasets to address the main topics covered. It distinguishes categories (themes)
from tags (keywords)
d) Release date and up to date Datasets are associated with a time or period tag, that is, date published, date updated and its frequency
e) Machine-readable formats Portal provides datasets formats that are machine-readable and allow easy re-use
f) Open data licence Portal provides license information related to the use of the published datasets
g) Visualization and statistics
Portal provides visualization and analytics capabilities to gain information about a dataset, e.g. in charts or
visualizations in maps.
Open dataset
feedback
a) Documentation and tutorials Portal provides high quality of documentation and tutorials to help users
b) Forum and contact form
Portal provides an opportunity to submit feedback on a dataset from the users to providers and forum to discuss and
exchange ideas among the users
c) User rating and comments Portal provides capabilities allowing the collection of user ratings and comments
d) Social media and sharing
Portal provides the integration with social media technologies to create a distribution channel for open data and sharing
feedback
Open dataset
request
a) Request form Portal provides a form to request or suggest new type or format type of open data
b) List of requests Portal provides a list of requests received from users, including the current state of request processing
c) Involvement in the process Portal provides capabilities allowing the involvement in the same dataset
STAGE II: ASPECTS COVERED IN THE
FRAMEWORK TO BE APPLIED
12. * Categories
• I category: open data specification
• II category: open data set feedback
• III category: open data set request
Result: 31.47 out of 42
Average result: 2.25 out of 3 points
Rank #17 out of 42 countries
Rank #11, if you look on the countries included
in EDP rating
When recalculated as a percentage (where not
fullfilled is 0), the average level of usability is 62,4%
Not bad, BUT, we wish to be better
For more detail see Nikiforova, A. (2020) - Comparative analysis of national open data
portals or whether your portal is ready to bring benefits from open data. In IADIS
International Conference on International Conference on ICT, Society and Human Beings
# Country Languages I* II* III* TOTAL Rank
(EDP)
1 Cyprus EL, EN 20,2 10,63 7,34 38,17 4
2 Russia RU, EN 16,61 11,39 8,49 36,49 N/A
3 France EN, FR, ES 20,2 9,38 6,59 35,93 1
4 Spain ES, CA, GL, EU, EN 17,97 9,58 8,01 35,56 2
5 Taiwan EN, TW, CN 17,09 10,84 7,06 34,99 N/A
6 Canada EN, FR 17,57 8,97 8,22 34,76 N/A
7 Austria AU 18,84 11,59 4,31 34,74 7
8 Colombia ES, EN 17,59 8,36 8,24 34,19 N/A
9 New Zealand EN 17,67 7,26 8,78 33,71 N/A
10 Ireland EN, GA 17,7 8,79 7,09 33,58 3
11 Portugal EN, FR, ES, PT 19,01 10,69 3,65 33,35 25
12 Finland FI, SV, EN 18,7 10,85 3,35 32,9 5
13 Lithuania LT, EN 15,27 8,19 8,75 32,21 27
14 Slovenia SL +Google Translate 18,71 9,15 4,28 32,14 6
15 India EN 18,39 8,16 5,22 31,77 N/A
16 Netherlands NL 15,97 8,41 7,16 31,54 10
17 Latvia LV, EN 16,43 8,28 6,76 31,47 11 **
18 USA EN 17,54 8,97 4,63 31,14 N/A
19 Singapore EN 19,08 6,51 4,90 30,49 N/A
20 Estonia EE, EN 16,81 8,66 4,94 30,41 18
21 Slovakia SK, EN 16,08 9,48 4,58 30,14 29
n – competitive in a specific aspect, should be considered as an example
n – very weak result, improvement needs to be made, it is worth looking at leaders (in green)
RESULTS I
13. Category Aspect Points (1 to 3 points)
Open dataset specification
a) Description of dataset 2,07
b) Publisher of dataset 2,74
c) Thematic categories and tags
2,36
d) Release date and up to date 2,9
e) Machine-readable formats 2,1
f) Open data licence 3
g) Visualization and statistics 1,26
Open dataset feedback
a) Documentation and tutorials
1,98
b) Forum and contact form 2,02
c) User rating and comments 1,76
d) Social media and sharing 2,52
Open dataset request
a) Request form 2,83
b) List of requests 1,88
c) Involvement in the process 2,05
RESULTS I
14. 0 0,5 1 1,5 2 2,5 3
2,58
2,46
2,64
2,40
2,22
2,63
1,77
2,17
1,94
1,65
2,15
2,06
1,55
1,54
Average rating for 42 countries
the aspect is assessed worse than in average for 42 countries the aspect is assessed better than in average for 42 countries
good enough
should be improved in the future, but can wait
must be improved!!!
RESULTS II
15. RESULTS III
good enough
should be improved
in the future, but can
wait
must be improved!!!
Overall OK, but far away from the excellent result and leaders
Feedback must be improved
NB! Feedback is the worst aspect for open data of all countries
16. For more detail see Nikiforova, A. (2020) - Comparative analysis of
national open data portals or whether your portal is ready to bring benefits
from open data. In IADIS International Conference on International
Conference on ICT, Society and Human Beings
RESULTS IV. COMPARISON WITH 41 COUNTRY
17. One of the most crucial aspects for open data, which affects users’ intention to reuse data and increase their value, however, is rarely
ensured.
the frequency of updates promised by the data publisher VS. the actual frequency of updates
3 parameters were analysed: “created”, “last updated” and “frequency”
12,0%
2,7%
14,7%
5,4%
0,5%
4,2%
6,1%
27,7%
0,2%
18,6%
1,0%
6,1%
0,7%
0%
20%
40%
60%
80%
100%
%
of
datasets
frequency of updates
Values of "frequency of updates"
STAGE III. FREQUENCY OF UPDATES
46%
25%
29%
Whether the frequency of updates of
the data is in line with real life? (%)
yes
no
hard to say
the use of different names designating the same object, or its property was
observed (e.g. “constantly” and “constantly updated”) it is recommended to
involve a list of allowable values or to develop and introduce a vocabulary,
values such as “unknown” and “other” should not be allowed since they do not
provide supposed information.
18. STAGE III. WHETHER API IS
AVAILABLE?
data sharing in the form of machine-readable APIs is more useful for its further use
and is one of the most critical success factors for open data initiatives,
one of the most significant challenges for open data, and, in the context of Latvia, this
is also valid - only 43.04% of datasets (170 out of 395) are supplemented with API
feature.
43%
57%
Whether API is
available? (%)
yes
no
51%
49%
Whether preview option
is available? (%)
yes
no
19. STAGE III. WHETHER THE DESCRIPTION
OF THE PARAMETERS IS AVAILABLE?
a description of parameters are aimed at clarifying, what the name of each parameter means;
only 20% of datasets are supplied with a description of parameters, where the textual
description appears to be less popular, since 54.4% of the data sets are supplemented with
the description available as a separate file.
21%
79%
Whether the description of
the parameters is available?
(%)
YES
NO
45%
55%
What is their form?
text
file
93%
7%
Whether the description
of dataset is provided?
(%)
YES
NO
20. Latvia’s open data is re-used;
the most popular topics demonstrate the
potential of open data and opportunities to
improve citizens’ lives by addressing topics
that are crucial for Latvia:
tourism,
calculation queues to the hospitals or
kindergarten,
finding parking,
finding the best place of life.
!!!It is strongly recommended to publish this
information on the portal to demonstrate the
positive effects of the use of open data, thereby
increasing public participation.
STAGE III. USE-CASES OR SHOW-
CASES
it is good practice to provide users with information on the number of applications based on
open data that have been re-used – use-cases, which are able to attract more people to the
portal and its data - these data are not provided.
71.43% of 42 analysed open data portals provide this feature, including our neighbours –
Lithuania (9), and Poland (27).
France, Luxembourg or Portugal has even a “use case upload” feature,
18 portals offer a mapping between the use cases and the datasets they are based on, e.g.
Ireland.
✘
21. BUT WE ARE GOOD ENOUGH IN …
despite a list of the challenges mentioned above that has not yet been solved, Latvia’s open data portal holds sufficiently high
positions in various ratings. Among the positive aspects, there are:
✔ the number of datasets and data publishers are provided and frequently updated;
✔ two languages, namely Latvian and English, are supported;
✔ the majority of datasets are downloadable and have at least basic set of [meta]data on it;
✔ search and filter by category, data format, tags, data publisher;
✔ a social media facility that can help create a social distribution channel for open data;
✔ a request form (appears to be a problem to a list of countries);
✔ a contact facility with data publisher;
✔ guidelines and news,
etc.
However, some aspects have not been checked on whether they are working as intended, thus, it is difficult to
draw any conclusions on their quality - only the fact of their presence is highlighted.
22. The study addresses the various researchers’ calls by dividing the analysis into three parts:
(a) a survey which studies the knowledge of the Latvian society (105 participants),
(b) the application of (Máchová et al., 2018) framework to the Latvia’s open data portal (40 participants),
(c) an analysis of additional aspects inspecting each data set (395 datasets).
the current situation is being assessed, and corrective actions are being recommended;
the implementation of the guidelines defined should also improve user participation by encouraging the awareness and reuse
of open data, which appears to be one of the most important but weakest aspects;
although the results of this study are more important mainly for open data portal holders, they could also be useful for users,
as the results:
(a) point to the weakest points to be taken into account using open data portal,
(b) provide aspects to be considered when selecting the portal to be used,
(c) potential open data publishers can use the results provided as a checklist when preparing their data for publishing.
RESULTS
23. This study will be continued to carry out an in-depth analysis of the identified challenges
to propose guidelines to address them,
to provide a detailed comparison with more successful open data portals,
a detailed literature review will also follow,
In addition, it is planned to find foreign collaborators (preferably from the countries representing the
leading portals) to jointly address challenges through exchange of experience.
FUTURE WORK
24. I am grateful to the participants taking part in the experiment within my workshop “Open data and data quality”,
which allowed to collect data and led to such results,
and Latvian Open Technology Association for sharing information regarding open data hackathons.
ACKNOWLEDGMENT
25. THANK YOU!
For more information, see ResearchGate
See also anastasijanikiforova.com
For questions or any other queries, contact me via email - Anastasija.Nikiforova@lu.lv
Article: Nikiforova, A. (2020). Assessment of the usability of Latvia’s open data portal or how close are we to
gaining benefits from open data. In In IADIS 14th International Conference on Interfaces and Human
Computer Interaction (pp. 51-28).