SlideShare una empresa de Scribd logo
1 de 11
Targetj Solutions 
 REAL TIME PROJECTS 
 IEEE BASED PROJECTS 
 EMBEDDED SYSTEMS 
 PAPER PUBLICATIONS 
MATLAB PROJECTS 
 targetjsolutions@gmail.com 
 (0)9611582234, (0)9945657526
Scalable 
Scheduling of 
Updates 
in Streaming 
Data 
Warehouses
Abstract 
 We then propose a scheduling framework that handles 
the complications encountered by a stream warehouse: 
view hierarchies and priorities, data consistency, inability 
to preempt updates, heterogeneity of update jobs 
caused by different interarrival times and data volumes 
among different sources, and transient overload. 
 A novel feature of our framework is that scheduling 
decisions do not depend on properties of update jobs 
(such as deadlines), but rather on the effect of update 
jobs on data staleness. Finally, we present a suite of 
update scheduling algorithms and extensive simulation 
experiments to map out factors which affect their 
performance
Existing System 
 Recent work on streaming warehouses has focused on 
speeding up the Extract-Transform-Load (ETL) process . There 
has also been work on supporting various warehouse 
maintenance policies, such as immediate (update views 
whenever the base data change), deferred (update views only 
when queried), and periodic [10]. 
 However, there has been a little work on choosing, of all the 
tables that are now out-of-date due to the arrival of new data, 
which one should be updated next
Disadvantages 
 The problem with this approach is that new data may arrive on 
multiple streams, but there is no mechanism for limiting the 
number of tables that can be updated simultaneously. Running 
too many parallel updates can degrade performance due to 
memory and CPU-cache thrashing (multiple memoryintensive 
ETL processes are likely to exhaust virtual memory), disk-arm 
thrashing, context switching,
Proposed System 
 Many metrics have been considered in the real-time scheduling 
literature. In a typical hard realtime system, jobs must be completed 
before their deadlines a simple metric to understand and to prove 
results about. 
 In a firm real-time system, jobs can miss their deadlines, and if they do, 
they are discarded. The performance metric in a firm real-time system 
is the fraction of jobs that meet their deadlines. However, a streaming 
warehouse must load all of the data that arrive; therefore no updates 
can be discarded. 
 In a soft real-time system, late jobs are allowed to stay in the system, 
and the performance metric is lateness (or tardiness), which is the 
difference between the completion times of late jobs and their 
deadlines. 
 We are not concerned about properties of the update jobs. Instead, 
we will define a scheduling metric in terms of data staleness, roughly 
defined as the difference between the current time and the time 
stamp of the most recent record in a table
Proposed System 
 Associated with the accountability feature, we also develop two 
distinct modes for auditing: push mode and pull mode. The push mode 
refers to logs being periodically sent to the data owner or stakeholder 
while the pull mode refers to an alternative approach whereby the user 
(or another authorized party) can retrieve the logs as needed. In 
summary, our main contributions are as follows: We propose a novel 
automatic and enforceable logging mechanism in the cloud. To our 
knowledge, this is the first time a systematic approach to data 
accountability through the novel usage of JAR files is proposed. . Our 
proposed architecture is platform independent and highly 
decentralized, in that it does not require any dedicated authentication 
or storage system in place. We go beyond traditional access control in 
that we provide a certain degree of usage control for the protected 
data after these are delivered to the receiver. We conduct 
experiments on a real cloud testbed. The results demonstrate the 
efficiency, scalability, and granularity of our approach. We also provide 
a detailed security analysis and discuss the reliability and strength of our 
architecture.
Advantages 
 The goal of a streaming warehouse is to propagate new data 
across all the relevant tables and views as quickly as possible. 
Once new data are loaded, the applications and triggers 
defined on the warehouse can take immediate action. This 
allows businesses to make decisions in nearly real time, which 
may lead to increased profits, improved customer satisfaction, 
and prevention of serious problems that could develop if no 
action was taken
Software Requirements 
 Operating System : Windows XP Professional 
 Environment : Visual Studio .NET 2010 
 Language : C#.NET 
Web Technology : Active Server Pages.Net 
 Back end : MS-SQL-Server 2008
Hardware Requirements: 
Processor : Pentium III / IV 
Hard Disk : 40 GB 
Ram : 256 MB 
Monitor : 15VGA Color 
Mouse : Ball / Optical 
Keyboard : 102 Keys
Reference 
 [1] B. Adelberg, H. Garcia-Molina, and B. Kao, “Applying Update 
Streams in a Soft Real-Time Database System,” Proc. ACM SIGMOD 
Int’l Conf. Management of Data, pp. 245-256, 1995. 
 [2] B. Babcock, S. Babu, M. Datar, and R. Motwani, “Chain: 
Operator Scheduling for Memory Minimization in Data Stream 
Systems,” Proc. ACM SIGMOD Int’l Conf. Management of Data, pp. 
253-264, 2003. 
 [3] S. Babu, U. Srivastava, and J. Widom, “Exploiting K-constraints to 
Reduce Memory Overhead in Continuous Queries over Data 
Streams,” ACM Trans. Database Systems, vol. 29, no. 3, pp. 545- 580, 
2004. 
 [4] S. Baruah, “The Non-preemptive Scheduling of Periodic Tasks 
upon Multiprocessors,” Real Time Systems, vol. 32, nos. 1/2, pp. 9- 
20, 2006. 
 [5] S. Baruah, N. Cohen, C. Plaxton, and D. Varvel, “Proportionate 
Progress: A Notion of Fairness in Resource Allocation,” Algorithmica, 
vol. 15, pp. 600-625, 1996.

Más contenido relacionado

La actualidad más candente

Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts EMC
 
Migrate to share point 2013 with avepoint 2.14.13
Migrate to share point 2013 with avepoint 2.14.13Migrate to share point 2013 with avepoint 2.14.13
Migrate to share point 2013 with avepoint 2.14.13Mary Leigh Mackie
 
DB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools Update
DB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools UpdateDB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools Update
DB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools UpdateBaha Majid
 
Dell Active System 800 Converged Infrastructure solution: User collaboration ...
Dell Active System 800 Converged Infrastructure solution: User collaboration ...Dell Active System 800 Converged Infrastructure solution: User collaboration ...
Dell Active System 800 Converged Infrastructure solution: User collaboration ...Principled Technologies
 
SQL in the cloud performance benchmarks
SQL in the cloud performance benchmarks SQL in the cloud performance benchmarks
SQL in the cloud performance benchmarks Thavash Govender
 
Introducing Jaspersoft 4.7
Introducing Jaspersoft 4.7Introducing Jaspersoft 4.7
Introducing Jaspersoft 4.7Mike Boyarski
 
Virtualizing Latency Sensitive Workloads and vFabric GemFire
Virtualizing Latency Sensitive Workloads and vFabric GemFireVirtualizing Latency Sensitive Workloads and vFabric GemFire
Virtualizing Latency Sensitive Workloads and vFabric GemFireCarter Shanklin
 
Demartek lenovo s3200_mixed_workload_environment_2016-01
Demartek lenovo s3200_mixed_workload_environment_2016-01Demartek lenovo s3200_mixed_workload_environment_2016-01
Demartek lenovo s3200_mixed_workload_environment_2016-01Lenovo Data Center
 
Times Ten in-memory database when time counts - Laszlo Ludas
Times Ten in-memory database when time counts - Laszlo LudasTimes Ten in-memory database when time counts - Laszlo Ludas
Times Ten in-memory database when time counts - Laszlo LudasORACLE USER GROUP ESTONIA
 
The Evolution of SQL Server as a Service - SQL Azure Managed Instance
The Evolution of SQL Server as a Service - SQL Azure Managed InstanceThe Evolution of SQL Server as a Service - SQL Azure Managed Instance
The Evolution of SQL Server as a Service - SQL Azure Managed InstanceJavier Villegas
 
MySQL enterprise backup overview
MySQL enterprise backup overviewMySQL enterprise backup overview
MySQL enterprise backup overview郁萍 王
 
Using Distributed In-Memory Computing for Fast Data Analysis
Using Distributed In-Memory Computing for Fast Data AnalysisUsing Distributed In-Memory Computing for Fast Data Analysis
Using Distributed In-Memory Computing for Fast Data AnalysisScaleOut Software
 
A Hybrid Technology Platform for Increasing the Speed of Operational Analytics
A Hybrid Technology Platform for Increasing the Speed of Operational AnalyticsA Hybrid Technology Platform for Increasing the Speed of Operational Analytics
A Hybrid Technology Platform for Increasing the Speed of Operational AnalyticsIBMGovernmentCA
 
Introduction to microsoft sql server 2008 r2
Introduction to microsoft sql server 2008 r2Introduction to microsoft sql server 2008 r2
Introduction to microsoft sql server 2008 r2Eduardo Castro
 
Azure SQL Managed Instance - SqlBits 2019
Azure SQL Managed Instance - SqlBits 2019Azure SQL Managed Instance - SqlBits 2019
Azure SQL Managed Instance - SqlBits 2019Jovan Popovic
 
IBM Pure Data System for Analytics (Netezza)
IBM Pure Data System for Analytics (Netezza)IBM Pure Data System for Analytics (Netezza)
IBM Pure Data System for Analytics (Netezza)Girish Srivastava
 

La actualidad más candente (18)

Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts
 
Migrate to share point 2013 with avepoint 2.14.13
Migrate to share point 2013 with avepoint 2.14.13Migrate to share point 2013 with avepoint 2.14.13
Migrate to share point 2013 with avepoint 2.14.13
 
DB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools Update
DB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools UpdateDB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools Update
DB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools Update
 
Dell Active System 800 Converged Infrastructure solution: User collaboration ...
Dell Active System 800 Converged Infrastructure solution: User collaboration ...Dell Active System 800 Converged Infrastructure solution: User collaboration ...
Dell Active System 800 Converged Infrastructure solution: User collaboration ...
 
Oracle Data Warehouse
Oracle Data WarehouseOracle Data Warehouse
Oracle Data Warehouse
 
SQL in the cloud performance benchmarks
SQL in the cloud performance benchmarks SQL in the cloud performance benchmarks
SQL in the cloud performance benchmarks
 
Introducing Jaspersoft 4.7
Introducing Jaspersoft 4.7Introducing Jaspersoft 4.7
Introducing Jaspersoft 4.7
 
Virtualizing Latency Sensitive Workloads and vFabric GemFire
Virtualizing Latency Sensitive Workloads and vFabric GemFireVirtualizing Latency Sensitive Workloads and vFabric GemFire
Virtualizing Latency Sensitive Workloads and vFabric GemFire
 
Demartek lenovo s3200_mixed_workload_environment_2016-01
Demartek lenovo s3200_mixed_workload_environment_2016-01Demartek lenovo s3200_mixed_workload_environment_2016-01
Demartek lenovo s3200_mixed_workload_environment_2016-01
 
Times Ten in-memory database when time counts - Laszlo Ludas
Times Ten in-memory database when time counts - Laszlo LudasTimes Ten in-memory database when time counts - Laszlo Ludas
Times Ten in-memory database when time counts - Laszlo Ludas
 
The Evolution of SQL Server as a Service - SQL Azure Managed Instance
The Evolution of SQL Server as a Service - SQL Azure Managed InstanceThe Evolution of SQL Server as a Service - SQL Azure Managed Instance
The Evolution of SQL Server as a Service - SQL Azure Managed Instance
 
MySQL enterprise backup overview
MySQL enterprise backup overviewMySQL enterprise backup overview
MySQL enterprise backup overview
 
Siva-Resume
Siva-ResumeSiva-Resume
Siva-Resume
 
Using Distributed In-Memory Computing for Fast Data Analysis
Using Distributed In-Memory Computing for Fast Data AnalysisUsing Distributed In-Memory Computing for Fast Data Analysis
Using Distributed In-Memory Computing for Fast Data Analysis
 
A Hybrid Technology Platform for Increasing the Speed of Operational Analytics
A Hybrid Technology Platform for Increasing the Speed of Operational AnalyticsA Hybrid Technology Platform for Increasing the Speed of Operational Analytics
A Hybrid Technology Platform for Increasing the Speed of Operational Analytics
 
Introduction to microsoft sql server 2008 r2
Introduction to microsoft sql server 2008 r2Introduction to microsoft sql server 2008 r2
Introduction to microsoft sql server 2008 r2
 
Azure SQL Managed Instance - SqlBits 2019
Azure SQL Managed Instance - SqlBits 2019Azure SQL Managed Instance - SqlBits 2019
Azure SQL Managed Instance - SqlBits 2019
 
IBM Pure Data System for Analytics (Netezza)
IBM Pure Data System for Analytics (Netezza)IBM Pure Data System for Analytics (Netezza)
IBM Pure Data System for Analytics (Netezza)
 

Similar a Scalable scheduling of updates in streaming data warehouses

Real-Time Data Warehouse Loading Methodology Ricardo Jorge S.docx
Real-Time Data Warehouse Loading Methodology Ricardo Jorge S.docxReal-Time Data Warehouse Loading Methodology Ricardo Jorge S.docx
Real-Time Data Warehouse Loading Methodology Ricardo Jorge S.docxsodhi3
 
Oracle database performance diagnostics - before your begin
Oracle database performance diagnostics  - before your beginOracle database performance diagnostics  - before your begin
Oracle database performance diagnostics - before your beginHemant K Chitale
 
IRJET - Efficient Load Balancing in a Distributed Environment
IRJET -  	  Efficient Load Balancing in a Distributed EnvironmentIRJET -  	  Efficient Load Balancing in a Distributed Environment
IRJET - Efficient Load Balancing in a Distributed EnvironmentIRJET Journal
 
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP OpsIRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP OpsIRJET Journal
 
USING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATION
USING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATIONUSING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATION
USING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATIONgerogepatton
 
USING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATION
USING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATIONUSING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATION
USING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATIONijaia
 
Using Semi-supervised Classifier to Forecast Extreme CPU Utilization
Using Semi-supervised Classifier to Forecast Extreme CPU UtilizationUsing Semi-supervised Classifier to Forecast Extreme CPU Utilization
Using Semi-supervised Classifier to Forecast Extreme CPU Utilizationgerogepatton
 
Migration to Oracle 12c Made Easy Using Replication Technology
Migration to Oracle 12c Made Easy Using Replication TechnologyMigration to Oracle 12c Made Easy Using Replication Technology
Migration to Oracle 12c Made Easy Using Replication TechnologyDonna Guazzaloca-Zehl
 
Survey of streaming data warehouse update scheduling
Survey of streaming data warehouse update schedulingSurvey of streaming data warehouse update scheduling
Survey of streaming data warehouse update schedulingeSAT Journals
 
Database performance management
Database performance managementDatabase performance management
Database performance managementscottaver
 
Approved TPA along with Integrity Verification in Cloud
Approved TPA along with Integrity Verification in CloudApproved TPA along with Integrity Verification in Cloud
Approved TPA along with Integrity Verification in CloudEditor IJCATR
 
Comparison of Reporting architectures
Comparison of Reporting architecturesComparison of Reporting architectures
Comparison of Reporting architecturesRajendran Avadaiappan
 
Capacity management for ETL System
Capacity management for ETL SystemCapacity management for ETL System
Capacity management for ETL SystemASHOK BHATLA
 
Capacity Management of an ETL System
Capacity Management of an ETL SystemCapacity Management of an ETL System
Capacity Management of an ETL SystemASHOK BHATLA
 
Conspectus data warehousing appliances – fad or future
Conspectus   data warehousing appliances – fad or futureConspectus   data warehousing appliances – fad or future
Conspectus data warehousing appliances – fad or futureDavid Walker
 
Scalable scheduling of updates in streaming data warehouses
Scalable scheduling of updates in streaming data warehousesScalable scheduling of updates in streaming data warehouses
Scalable scheduling of updates in streaming data warehousesIRJET Journal
 
best-practices-for-realtime-data-wa-132882.pdf
best-practices-for-realtime-data-wa-132882.pdfbest-practices-for-realtime-data-wa-132882.pdf
best-practices-for-realtime-data-wa-132882.pdfaliramezani30
 
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...Prolifics
 
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATANEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATAcsandit
 

Similar a Scalable scheduling of updates in streaming data warehouses (20)

Real-Time Data Warehouse Loading Methodology Ricardo Jorge S.docx
Real-Time Data Warehouse Loading Methodology Ricardo Jorge S.docxReal-Time Data Warehouse Loading Methodology Ricardo Jorge S.docx
Real-Time Data Warehouse Loading Methodology Ricardo Jorge S.docx
 
Oracle database performance diagnostics - before your begin
Oracle database performance diagnostics  - before your beginOracle database performance diagnostics  - before your begin
Oracle database performance diagnostics - before your begin
 
IRJET - Efficient Load Balancing in a Distributed Environment
IRJET -  	  Efficient Load Balancing in a Distributed EnvironmentIRJET -  	  Efficient Load Balancing in a Distributed Environment
IRJET - Efficient Load Balancing in a Distributed Environment
 
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP OpsIRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
 
USING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATION
USING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATIONUSING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATION
USING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATION
 
USING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATION
USING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATIONUSING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATION
USING SEMI-SUPERVISED CLASSIFIER TO FORECAST EXTREME CPU UTILIZATION
 
Using Semi-supervised Classifier to Forecast Extreme CPU Utilization
Using Semi-supervised Classifier to Forecast Extreme CPU UtilizationUsing Semi-supervised Classifier to Forecast Extreme CPU Utilization
Using Semi-supervised Classifier to Forecast Extreme CPU Utilization
 
Migration to Oracle 12c Made Easy Using Replication Technology
Migration to Oracle 12c Made Easy Using Replication TechnologyMigration to Oracle 12c Made Easy Using Replication Technology
Migration to Oracle 12c Made Easy Using Replication Technology
 
Survey of streaming data warehouse update scheduling
Survey of streaming data warehouse update schedulingSurvey of streaming data warehouse update scheduling
Survey of streaming data warehouse update scheduling
 
Database performance management
Database performance managementDatabase performance management
Database performance management
 
Approved TPA along with Integrity Verification in Cloud
Approved TPA along with Integrity Verification in CloudApproved TPA along with Integrity Verification in Cloud
Approved TPA along with Integrity Verification in Cloud
 
Comparison of Reporting architectures
Comparison of Reporting architecturesComparison of Reporting architectures
Comparison of Reporting architectures
 
Capacity management for ETL System
Capacity management for ETL SystemCapacity management for ETL System
Capacity management for ETL System
 
Capacity Management of an ETL System
Capacity Management of an ETL SystemCapacity Management of an ETL System
Capacity Management of an ETL System
 
Conspectus data warehousing appliances – fad or future
Conspectus   data warehousing appliances – fad or futureConspectus   data warehousing appliances – fad or future
Conspectus data warehousing appliances – fad or future
 
Scalable scheduling of updates in streaming data warehouses
Scalable scheduling of updates in streaming data warehousesScalable scheduling of updates in streaming data warehouses
Scalable scheduling of updates in streaming data warehouses
 
best-practices-for-realtime-data-wa-132882.pdf
best-practices-for-realtime-data-wa-132882.pdfbest-practices-for-realtime-data-wa-132882.pdf
best-practices-for-realtime-data-wa-132882.pdf
 
Updating and Scheduling of Streaming Web Services in Data Warehouses
Updating and Scheduling of Streaming Web Services in Data WarehousesUpdating and Scheduling of Streaming Web Services in Data Warehouses
Updating and Scheduling of Streaming Web Services in Data Warehouses
 
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...
Architecting and Tuning IIB/eXtreme Scale for Maximum Performance and Reliabi...
 
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATANEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
 

Más de Finalyear Projects

Emotion recognition from facial expression using fuzzy logic
Emotion recognition from facial expression using fuzzy logicEmotion recognition from facial expression using fuzzy logic
Emotion recognition from facial expression using fuzzy logicFinalyear Projects
 
Energy efficient reverse skyline query processing over wireless sensor networks
Energy efficient reverse skyline query processing over wireless sensor networksEnergy efficient reverse skyline query processing over wireless sensor networks
Energy efficient reverse skyline query processing over wireless sensor networksFinalyear Projects
 
Single sign on assistant an authentication brokers
Single sign on assistant an authentication brokersSingle sign on assistant an authentication brokers
Single sign on assistant an authentication brokersFinalyear Projects
 
REAL TIME PROJECTS IEEE BASED PROJECTS EMBEDDED SYSTEMS PAPER PUBLICATIONS M...
REAL TIME PROJECTS  IEEE BASED PROJECTS EMBEDDED SYSTEMS PAPER PUBLICATIONS M...REAL TIME PROJECTS  IEEE BASED PROJECTS EMBEDDED SYSTEMS PAPER PUBLICATIONS M...
REAL TIME PROJECTS IEEE BASED PROJECTS EMBEDDED SYSTEMS PAPER PUBLICATIONS M...Finalyear Projects
 
Detection and localization of multiple spoofing attacks in
Detection and localization of multiple spoofing attacks inDetection and localization of multiple spoofing attacks in
Detection and localization of multiple spoofing attacks inFinalyear Projects
 
A novel vlsi dht algorithm for a highly
A novel vlsi dht algorithm for a highlyA novel vlsi dht algorithm for a highly
A novel vlsi dht algorithm for a highlyFinalyear Projects
 
A novel vlsi dht algorithm for a highly
A novel vlsi dht algorithm for a highlyA novel vlsi dht algorithm for a highly
A novel vlsi dht algorithm for a highlyFinalyear Projects
 
Fpga implementation of truncated multiplier for array multiplication
Fpga implementation of truncated multiplier for array multiplicationFpga implementation of truncated multiplier for array multiplication
Fpga implementation of truncated multiplier for array multiplicationFinalyear Projects
 
Towards energy efficient big data gathering
Towards energy efficient big data gatheringTowards energy efficient big data gathering
Towards energy efficient big data gatheringFinalyear Projects
 
Java ieee projects titles 2014 2015
Java ieee projects  titles 2014 2015Java ieee projects  titles 2014 2015
Java ieee projects titles 2014 2015Finalyear Projects
 
System proposal and crs model design applying
System proposal and crs model design applyingSystem proposal and crs model design applying
System proposal and crs model design applyingFinalyear Projects
 
Discovering emerging topics in social streams via link anomaly detection
Discovering emerging topics in social streams via link anomaly detectionDiscovering emerging topics in social streams via link anomaly detection
Discovering emerging topics in social streams via link anomaly detectionFinalyear Projects
 
Navigation by the soles of your feet
Navigation by the soles of your feetNavigation by the soles of your feet
Navigation by the soles of your feetFinalyear Projects
 
Android Mobile - Home Automation
Android Mobile - Home Automation Android Mobile - Home Automation
Android Mobile - Home Automation Finalyear Projects
 

Más de Finalyear Projects (17)

Emotion recognition from facial expression using fuzzy logic
Emotion recognition from facial expression using fuzzy logicEmotion recognition from facial expression using fuzzy logic
Emotion recognition from facial expression using fuzzy logic
 
Energy efficient reverse skyline query processing over wireless sensor networks
Energy efficient reverse skyline query processing over wireless sensor networksEnergy efficient reverse skyline query processing over wireless sensor networks
Energy efficient reverse skyline query processing over wireless sensor networks
 
Single sign on assistant an authentication brokers
Single sign on assistant an authentication brokersSingle sign on assistant an authentication brokers
Single sign on assistant an authentication brokers
 
REAL TIME PROJECTS IEEE BASED PROJECTS EMBEDDED SYSTEMS PAPER PUBLICATIONS M...
REAL TIME PROJECTS  IEEE BASED PROJECTS EMBEDDED SYSTEMS PAPER PUBLICATIONS M...REAL TIME PROJECTS  IEEE BASED PROJECTS EMBEDDED SYSTEMS PAPER PUBLICATIONS M...
REAL TIME PROJECTS IEEE BASED PROJECTS EMBEDDED SYSTEMS PAPER PUBLICATIONS M...
 
IEEE Projects 2014-2015
IEEE Projects 2014-2015IEEE Projects 2014-2015
IEEE Projects 2014-2015
 
Detection and localization of multiple spoofing attacks in
Detection and localization of multiple spoofing attacks inDetection and localization of multiple spoofing attacks in
Detection and localization of multiple spoofing attacks in
 
A novel vlsi dht algorithm for a highly
A novel vlsi dht algorithm for a highlyA novel vlsi dht algorithm for a highly
A novel vlsi dht algorithm for a highly
 
A novel vlsi dht algorithm for a highly
A novel vlsi dht algorithm for a highlyA novel vlsi dht algorithm for a highly
A novel vlsi dht algorithm for a highly
 
Fpga implementation of truncated multiplier for array multiplication
Fpga implementation of truncated multiplier for array multiplicationFpga implementation of truncated multiplier for array multiplication
Fpga implementation of truncated multiplier for array multiplication
 
Keyword query routing
Keyword query routingKeyword query routing
Keyword query routing
 
Towards energy efficient big data gathering
Towards energy efficient big data gatheringTowards energy efficient big data gathering
Towards energy efficient big data gathering
 
Java ieee projects titles 2014 2015
Java ieee projects  titles 2014 2015Java ieee projects  titles 2014 2015
Java ieee projects titles 2014 2015
 
System proposal and crs model design applying
System proposal and crs model design applyingSystem proposal and crs model design applying
System proposal and crs model design applying
 
Discovering emerging topics in social streams via link anomaly detection
Discovering emerging topics in social streams via link anomaly detectionDiscovering emerging topics in social streams via link anomaly detection
Discovering emerging topics in social streams via link anomaly detection
 
Navigation by the soles of your feet
Navigation by the soles of your feetNavigation by the soles of your feet
Navigation by the soles of your feet
 
Android Mobile - Home Automation
Android Mobile - Home Automation Android Mobile - Home Automation
Android Mobile - Home Automation
 
Home automation
Home automationHome automation
Home automation
 

Scalable scheduling of updates in streaming data warehouses

  • 1. Targetj Solutions  REAL TIME PROJECTS  IEEE BASED PROJECTS  EMBEDDED SYSTEMS  PAPER PUBLICATIONS MATLAB PROJECTS  targetjsolutions@gmail.com  (0)9611582234, (0)9945657526
  • 2. Scalable Scheduling of Updates in Streaming Data Warehouses
  • 3. Abstract  We then propose a scheduling framework that handles the complications encountered by a stream warehouse: view hierarchies and priorities, data consistency, inability to preempt updates, heterogeneity of update jobs caused by different interarrival times and data volumes among different sources, and transient overload.  A novel feature of our framework is that scheduling decisions do not depend on properties of update jobs (such as deadlines), but rather on the effect of update jobs on data staleness. Finally, we present a suite of update scheduling algorithms and extensive simulation experiments to map out factors which affect their performance
  • 4. Existing System  Recent work on streaming warehouses has focused on speeding up the Extract-Transform-Load (ETL) process . There has also been work on supporting various warehouse maintenance policies, such as immediate (update views whenever the base data change), deferred (update views only when queried), and periodic [10].  However, there has been a little work on choosing, of all the tables that are now out-of-date due to the arrival of new data, which one should be updated next
  • 5. Disadvantages  The problem with this approach is that new data may arrive on multiple streams, but there is no mechanism for limiting the number of tables that can be updated simultaneously. Running too many parallel updates can degrade performance due to memory and CPU-cache thrashing (multiple memoryintensive ETL processes are likely to exhaust virtual memory), disk-arm thrashing, context switching,
  • 6. Proposed System  Many metrics have been considered in the real-time scheduling literature. In a typical hard realtime system, jobs must be completed before their deadlines a simple metric to understand and to prove results about.  In a firm real-time system, jobs can miss their deadlines, and if they do, they are discarded. The performance metric in a firm real-time system is the fraction of jobs that meet their deadlines. However, a streaming warehouse must load all of the data that arrive; therefore no updates can be discarded.  In a soft real-time system, late jobs are allowed to stay in the system, and the performance metric is lateness (or tardiness), which is the difference between the completion times of late jobs and their deadlines.  We are not concerned about properties of the update jobs. Instead, we will define a scheduling metric in terms of data staleness, roughly defined as the difference between the current time and the time stamp of the most recent record in a table
  • 7. Proposed System  Associated with the accountability feature, we also develop two distinct modes for auditing: push mode and pull mode. The push mode refers to logs being periodically sent to the data owner or stakeholder while the pull mode refers to an alternative approach whereby the user (or another authorized party) can retrieve the logs as needed. In summary, our main contributions are as follows: We propose a novel automatic and enforceable logging mechanism in the cloud. To our knowledge, this is the first time a systematic approach to data accountability through the novel usage of JAR files is proposed. . Our proposed architecture is platform independent and highly decentralized, in that it does not require any dedicated authentication or storage system in place. We go beyond traditional access control in that we provide a certain degree of usage control for the protected data after these are delivered to the receiver. We conduct experiments on a real cloud testbed. The results demonstrate the efficiency, scalability, and granularity of our approach. We also provide a detailed security analysis and discuss the reliability and strength of our architecture.
  • 8. Advantages  The goal of a streaming warehouse is to propagate new data across all the relevant tables and views as quickly as possible. Once new data are loaded, the applications and triggers defined on the warehouse can take immediate action. This allows businesses to make decisions in nearly real time, which may lead to increased profits, improved customer satisfaction, and prevention of serious problems that could develop if no action was taken
  • 9. Software Requirements  Operating System : Windows XP Professional  Environment : Visual Studio .NET 2010  Language : C#.NET Web Technology : Active Server Pages.Net  Back end : MS-SQL-Server 2008
  • 10. Hardware Requirements: Processor : Pentium III / IV Hard Disk : 40 GB Ram : 256 MB Monitor : 15VGA Color Mouse : Ball / Optical Keyboard : 102 Keys
  • 11. Reference  [1] B. Adelberg, H. Garcia-Molina, and B. Kao, “Applying Update Streams in a Soft Real-Time Database System,” Proc. ACM SIGMOD Int’l Conf. Management of Data, pp. 245-256, 1995.  [2] B. Babcock, S. Babu, M. Datar, and R. Motwani, “Chain: Operator Scheduling for Memory Minimization in Data Stream Systems,” Proc. ACM SIGMOD Int’l Conf. Management of Data, pp. 253-264, 2003.  [3] S. Babu, U. Srivastava, and J. Widom, “Exploiting K-constraints to Reduce Memory Overhead in Continuous Queries over Data Streams,” ACM Trans. Database Systems, vol. 29, no. 3, pp. 545- 580, 2004.  [4] S. Baruah, “The Non-preemptive Scheduling of Periodic Tasks upon Multiprocessors,” Real Time Systems, vol. 32, nos. 1/2, pp. 9- 20, 2006.  [5] S. Baruah, N. Cohen, C. Plaxton, and D. Varvel, “Proportionate Progress: A Notion of Fairness in Resource Allocation,” Algorithmica, vol. 15, pp. 600-625, 1996.