SlideShare una empresa de Scribd logo
1 de 5
Descargar para leer sin conexión
International Journal Of Computational Engineering Research (ijceronline.com) Vol. 2 Issue. 5



                 A Data Throughput Prediction Using Scheduling And
                              Assignment Technique
                      M.Rajarajeswari 1, P.R.Kandasamy2, T.Ravichandran3
                       1. Research Scholar, Dept. of Mathematics Karpagam University,Coimbatore.
                   2.Professor and Head, Dept. of M.CA Hindusthan Institute of Technology,coimbatore.
                             3.The Principal, Hindusthan Institute of Technology, coimbatore.


Abstract:
Task computing is a computation to fill the gap between tasks (what user wants to be done), and services (functionalities that
are available to the user). Task computing seeks to redefine how users interact with and use computing environments. Wide
distributed many-task computing (MTC) environment aims to bridge the gap between two computing paradigms, high
throughput computing (HTC) and high-performance computing (HPC). In a wide distributed many-task computing
environment, data sharing between participating clusters may become a major performance constriction. In this project, we
present the design and implementation of an application-layer data throughput prediction and optimization service for many-
task computing in widely distributed environments using Operation research. This service uses multiple parallel TCP streams
which are used to find maximum data distribution stream through assignment model which is to improve the end-to-end
throughput of data transfers in the network. A novel mathematical model (optimization model) is developed to determine the
number of parallel streams, required to achieve the best network performance. This model can predict the optimal number of
parallel streams with as few as three prediction points (i.e. three Switching points). We implement this new service in the
Stork Data Scheduler model, where the prediction points can be obtained using Iperf and GridFTP samplings technique. Our
results show that the prediction cost plus the optimized transfer time is much less than the non optimized transfer time in
most cases. As a result, Stork data model evaluates and transfer jobs with optimization service based sampling rate and no. of
task is given as input, so our system can be completed much earlier, compared to non optimized data transfer jobs.

Key words: Optimization, Assignment Technique, Stork scheduling Data throughput.

Modules:
1) Construction of Grid Computing Architecture.
 2) Applying Optimization Service.
 3) Integration with Stork Data cheduler.
4) Applying Quantity Control of Sampling Data.
5) Performance Comparison.

Existing System:
TCP is the most widely adopted transport protocol but it has major bottleneck. So we have gone for other different
implementation techniques, in both at the transport and application layers, to overcome the inefficient network utilization of
the TCP protocol. At the transport layer, different variations of TCP have been implemented to more efficiently utilize high-
speed networks. At the application layer, other improvements are proposed on top of the regular TCP, such as opening
multiple parallel streams or tuning the buffer size. Parallel TCP streams are able to achieve high network throughput by
behaving like a single giant stream, which is the combination of n streams, and getting an unfair share of the available
bandwidth.

Disadvantage Of System:
     In a widely distributed many-task computing ernvionment, data communication between participating clusters may
       become a major performance bottleneck.
     TCP to fully utilize the available network bandwidth. This becomes a major bottleneck, especially in wide-area high
       speed networks, where both bandwidth and delay properties are too large, which, in turn, results in a large delay
       before the bandwidth is fully saturated.
     Inefficient network utilization.


Issn 2250-3005(online)                                         September| 2012                              Page 1306
International Journal Of Computational Engineering Research (ijceronline.com) Vol. 2 Issue. 5



Proposed System:
   We present the design and implementation of a service that will provide the user with the optimal number of parallel TCP
streams as well as a provision of the estimated time and throughput for a specific data transfer. A novel mathematical model
(optimization model) is developed to determine the number of parallel streams, required to achieve the best network
performance. This model can predict the optimal number of parallel streams with as few as three prediction points (i.e. three
Switching points). We implement this new service in the Stork Data Scheduler model, where the prediction points can be
obtained using Iperf and GridFTP samplings technique.

Advantage of System:
    The prediction models, the quantity control of sampling and the algorithms applied using the mathematical models.
    We have improved an existing prediction model by using three prediction points and adapting a full second order
       equation or an equation where the order is determined dynamically. We have designed an exponentially increasing
       sampling strategy to get the data pairs for prediction
    The algorithm to instantiate the throughput function with respect to the number of parallel streams can avoid the
       ineffectiveness of the prediction models due to some unexpected sampling data pairs.
    We propose to find a solution to satisfy both the time limitation and the accuracy requirements. Our approach
       doubles the number of parallel streams for every iteration of sampling, and observes the corresponding throughput.
    We implement this new service in the Stork Data Scheduler, where the prediction points can be obtained using Iperf
       and GridFTP samplings

 Implementation module:
 In this project we have implemented the optimization service, based on both Iperf and GridFTP. The structure of our design
 and presents two scenarios based on both, GridFTP and Iperf versions of the service. For the GridFTP version, these hosts
 would have GridFTP servers. For the Iperf version, these hosts would have Iperf servers running as well as a small remote
 module (TranServer) that will make a request to Iperf. Optimization server is the orchestrator host, designated to perform
 the optimization of TCP parameters and store the resultant data. It also has to be recognized by the sites, since the third-
 party sampling of throughput data will be performed by it. User/Client represents the host that sends out the request of
 optimization to the server. All of these hosts are connected via LAN. When a user wants to transfer data between two site ,
 the user will first send a request that consists of source and destination addresses, file size, and an optional buffer size
 parameter to the optimization server, which process the request and respond to the user with the optimal parallel stream
 number to do the transfer. The buffer size parameter is an optional parameter, which is given to the GridFTP protocol to set
 the buffer size to a different value than the system set buffer size. At the same time, the optimization server will estimate the
 optimal throughput that can be achieved, and the time needed to finish the specified transfer between two site. This
 information is then returned back to the User/Client making the request. Stork is a batch scheduler, specialized in data
 placement and movement. In this implementation, Stork is extended to support both estimation and optimization tasks. A
 task is categorized as an estimation task, if only estimated information regarding to the specific data movement is reported
 without the actual transfer. On the other hand, a task is categorized as optimization, if the specific data movement is
 performed, according to the optimized estimation results.

Mathematical Model :
          A novel mathematical model (optimization model) is developed to determine the number of parallel streams,
required to achieve the best network performance. This model can predict the optimal number of parallel streams with as few
as three prediction points (i.e. three Switching points).We propose to find a solution to satisfy both the time limitation and the
accuracy requirements. Our approach doubles the number of parallel streams for every iteration of sampling, and observes the
corresponding throughput. While the throughput increases, if the slope of the curve is below a threshold between successive
iterations, the sampling stops. Another stopping condition is: if the throughput decreases compared to the previous iteration
before reaching that threshold.
Assignment Problem:
Consider an matrix with n rows and n columns, rows will be consider as grid let and columns will as jobs.
Like,




Issn 2250-3005(online)                                           September| 2012                               Page 1307
International Journal Of Computational Engineering Research (ijceronline.com) Vol. 2 Issue. 5




                                         Job 1       Job 2    …….           Job n

                           Grid 1          Task       Task2 …              Task
                                           1                ….             n
                           Grid 2           Tas       Task …               Task
                                            k1        2     ….             n
                           …….
                                            Tas        Task          …     Task
                           Grid n           k1         2             ….    n
There will be more than one job for each grid so assign problem occur. to solve this problem we are going for new
mathematical model to solve this problem. Three condition occur
    1) Find out the minim value of each row and subtract least value with same row task value. Make least value as zero.
        If process matrix diagonally comes as “0” then process stop and job will assign successfully and job assign
        successfully.
                                                 Job 1       Job2                 Job n


                                    Grid
                                                     10       3           … 0
                                    1
                                                                          …
                                    Grid                                  .
                                                                          … 7
                                                      6          0
                                    2                                     …
                                    …….                                     .
                                                      0          6         … 7
                                    Grid                                   …
                                    n                                         .
2) Find out the minim value of each row and Subtract least value with same row task value. Make least value as zero. if
   column wise matrix diagonally comes as “0” then , then Find out minim value of each column and subtract least value
   with same row column value. Make least value as zero. Then if process matrix diagonally comes as “0” then process
   stop and job will assign successfully and job assign successfully.

                                             Job 1        Job 2       …….         Job n

                                Grid 1           10          0            …   3
                                                                          …
                                Grid 2            6          0            …
                                                                          .   7
                                                                          ….
                                 …….
                                                 6           0              … 7
                                Grid n                                     ….
    3) Find out the minim value of each row and subtract least value with same row task value. Make least value as zero.
       If column wise matrix diagonally comes as “0” then, then Find out minim value of each column and subtract least
       value with same row column value. Make least value as zero. Then if process matrix diagonally will comes as “0”
       then process stop and that job will be discard .


Issn 2250-3005(online)                                           September| 2012                        Page 1308
International Journal Of Computational Engineering Research (ijceronline.com) Vol. 2 Issue. 5




                                                   Job 1       Job 2         …….              Job n

                                     Grid 1            10            0         …          3
                                                                               …
                                     Grid 2              6           0         …
                                                                               .          7
                                                                               ….
                                     …….
                                                       6             0           …        7
                                     Grid n                                      ….


                                                 Job 1       Job 2         …….            Job n

                                  Grid 1
                                                    10         0             … 4
                                                                             …
                                  Grid 2             6          0            …
                                                                             .   7
                                                                             ….
                                  …….
                                                     0          6             …       7
                                  Grid n                                      ….
Experimental results:
 Test Scenario         Pre-Condition             Test Case                      Expected Output             Actual Output         Result
   Applying         Client send request        Check the client              Client Requested will be        File received         Pass
 Optimization       to EOS server then      request will be Valid         response to client with some       successfully
    Service         that request will be   request or not(only txt           information (number of
                     transfer to Gridftp      format file can be           stream taken to transfer the
                           server            download/Estimate           file and throughput achieve at
                                            from Gridftp server )            the time of transfer file)
 Integration with    If More than one       Stork data scheduler           Priority will be assigned to       According to         Pass
    Stork Data          Client make        make schedule for the          each user request(first come     priority the request
    Scheduler         request to EOS         incoming request             first process) / if estimation   process in Gridftp
                           server             exceed than one            request will be give most first        server and
                                           request/ checking the                     priority               response given to
                                              is Estimation or                                                    client
                                              optimization Of
                                                   service
Performance Comparison




Issn 2250-3005(online)                                                   September| 2012                                   Page 1309
International Journal Of Computational Engineering Research (ijceronline.com) Vol. 2 Issue. 5




Conclusion:
          This study describes the design and implementation of a network throughput prediction and optimization service for
many-task computing in widely distributed environments. This involves the selection of prediction models, the mathematical
models. We have improved an existing prediction model by using three prediction points and adapting a full second order
equation, or an equation where the order is determined dynamically. We have designed an exponentially increasing sampling
strategy to get the data pairs prediction. We implement this new service in the Stork Data Scheduler, where the prediction
points can be obtained using Iperf and GridFTP samplings. The experimental results justify our improved models as well as
the algorithms applied to the implementation. When used within the Stork Data Scheduler, the optimization service decreases
the total transfer time for a large number of data transfer jobs submitted to the scheduler significantly compared to the non
optimized Stork transfers.

Reference:
[1]    I. Raicu, I. Foster, and Y. Zhao, “Many-Task Computing for Grids and Supercomputers,” Proc. IEEE Workshop Many-
       Task Computing on Grids and Supercomputers (MTAGS), 2008.
[2]    Louisiana Optical Network Initiative (LONI), http://www.loni.org/, 2010.
[3]    Energy Sciences Network (ESNet), http://www.es.net/, 2010. [4] TeraGrid, http://www.teragrid.org/, 2010.
[5]    S. Floyd, “Rfc 3649: Highspeed TCP for Large Congestion Windows,” 2003.
[6]    R. Kelly, “Scalable TCP: Improving Performance in Highspeed Wide Area Networks,” Computer Comm. Rev., vol. 32,
       Experiments,” IEEE Network, vol. 19, no. 1, pp. 4-11, Feb. 2005. no. 2, pp. 83- 91, 2003.
[7]    C. Jin, D.X. Wei, S.H. Low, G. Buhrmaster, J. Bunn, D.H. Choe, R.L.A. Cottrell, J.C. Doyle, W. Feng, O. Martin, H.
       Newman, F. Paganini, S. Ravot, and S. Singh, “Fast TCP: From Theory to Networks,” Computer Comm. Rev., vol. 32,
       Experiments,” IEEE Network, vol. 19, no. 1, pp. 4-11, Feb. 2005.
[8]    H. Sivakumar, S. Bailey, and R.L. Grossman, “Psockets: The Case for Application-Level Network Striping for Data
       Intensive Applications Using High Speed Wide Area Networks,” Proc.
       IEEE Super Computing Conf. (SC ’00), p. 63, Nov. 2000.
[9]    J. Lee, D. Gunter, B. Tierney, B. Allcock, J. Bester, J. Bresnahan, and S. Tuecke, “Applied Techniques for High
       Bandwidth Data Transfers across Wide Area Networks,” Proc. Int’l Conf. Computing in High Energy and Nuclear
       Physics (CHEP ’01), Sept. 2001.
[10]   H. Balakrishman, V.N. Padmanabhan, S. Seshan, and R.H.K.M. Stemm, “TCP Behavior of a Busy Internet Server:
       Analysis and Improvements,” Proc. IEEE INFOCOM ’98, pp. 252-262, Mar. 1998.
[11]   T.J. Hacker, B.D. Noble, and B.D. Atley, “Adaptive Data Block Scheduling for Parallel Streams,” Proc. IEEE Int’l
       Symp. High Performance Distributed Computing (HPDC ’05), pp. 265-275, July 2005.
[12]   L. Eggert, J. Heideman, and J. Touch, “Effects of Ensemble TCP,” ACM Computer Comm. Rev., vol. 30, no. 1, pp. 15-
       29, 2000.
[13]   R.P. Karrer, J. Park, and J. Kim, “TCP-ROME: Performance and Fairness in Parallel Downloads for Web and Real
       Time Multimedia Streaming Applications,” technical report, Deutsche Telekom Labs, 2006.
[14]   D. Lu, Y. Qiao, and P.A. Dinda, “Characterizing and Predicting TCP Throughput on the Wide Area Network,” Proc.
       IEEE Int’l Conf. Distributed Computing Systems (ICDCS ’05), pp. 414-424, 2005.
[15]   E. Yildirim, D. Yin, and T. Kosar, “Prediction of Optimal Parallelism Level in Wide Area Data Transfers,” to be
       published in IEEE Trans. Parallel and Distributed Systems, 2010.



Issn 2250-3005(online)                                        September| 2012                              Page 1310

Más contenido relacionado

La actualidad más candente

A Comparison of Serial and Parallel Substring Matching Algorithms
A Comparison of Serial and Parallel Substring Matching AlgorithmsA Comparison of Serial and Parallel Substring Matching Algorithms
A Comparison of Serial and Parallel Substring Matching Algorithmszexin wan
 
Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016ijcsbi
 
Parallel Batch-Dynamic Graphs: Algorithms and Lower Bounds
Parallel Batch-Dynamic Graphs: Algorithms and Lower BoundsParallel Batch-Dynamic Graphs: Algorithms and Lower Bounds
Parallel Batch-Dynamic Graphs: Algorithms and Lower BoundsSubhajit Sahu
 
Basic communication operations - One to all Broadcast
Basic communication operations - One to all BroadcastBasic communication operations - One to all Broadcast
Basic communication operations - One to all BroadcastRashiJoshi11
 
Congestion Control through Load Balancing Technique for Mobile Networks: A Cl...
Congestion Control through Load Balancing Technique for Mobile Networks: A Cl...Congestion Control through Load Balancing Technique for Mobile Networks: A Cl...
Congestion Control through Load Balancing Technique for Mobile Networks: A Cl...IDES Editor
 
A Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in ParallelA Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in ParallelJenny Liu
 
Query optimization for_sensor_networks
Query optimization for_sensor_networksQuery optimization for_sensor_networks
Query optimization for_sensor_networksHarshavardhan Achrekar
 
A QUANTITATIVE ANALYSIS OF HANDOVER TIME AT MAC LAYER FOR WIRELESS MOBILE NET...
A QUANTITATIVE ANALYSIS OF HANDOVER TIME AT MAC LAYER FOR WIRELESS MOBILE NET...A QUANTITATIVE ANALYSIS OF HANDOVER TIME AT MAC LAYER FOR WIRELESS MOBILE NET...
A QUANTITATIVE ANALYSIS OF HANDOVER TIME AT MAC LAYER FOR WIRELESS MOBILE NET...ijwmn
 
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...cscpconf
 

La actualidad más candente (20)

Chap9 slides
Chap9 slidesChap9 slides
Chap9 slides
 
A Comparison of Serial and Parallel Substring Matching Algorithms
A Comparison of Serial and Parallel Substring Matching AlgorithmsA Comparison of Serial and Parallel Substring Matching Algorithms
A Comparison of Serial and Parallel Substring Matching Algorithms
 
Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016
 
Journal paper 1
Journal paper 1Journal paper 1
Journal paper 1
 
Chap5 slides
Chap5 slidesChap5 slides
Chap5 slides
 
Parallel Batch-Dynamic Graphs: Algorithms and Lower Bounds
Parallel Batch-Dynamic Graphs: Algorithms and Lower BoundsParallel Batch-Dynamic Graphs: Algorithms and Lower Bounds
Parallel Batch-Dynamic Graphs: Algorithms and Lower Bounds
 
Fairness in Transfer Control Protocol for Congestion Control in Multiplicativ...
Fairness in Transfer Control Protocol for Congestion Control in Multiplicativ...Fairness in Transfer Control Protocol for Congestion Control in Multiplicativ...
Fairness in Transfer Control Protocol for Congestion Control in Multiplicativ...
 
Broadcast in Hypercube
Broadcast in HypercubeBroadcast in Hypercube
Broadcast in Hypercube
 
Basic communication operations - One to all Broadcast
Basic communication operations - One to all BroadcastBasic communication operations - One to all Broadcast
Basic communication operations - One to all Broadcast
 
Congestion Control through Load Balancing Technique for Mobile Networks: A Cl...
Congestion Control through Load Balancing Technique for Mobile Networks: A Cl...Congestion Control through Load Balancing Technique for Mobile Networks: A Cl...
Congestion Control through Load Balancing Technique for Mobile Networks: A Cl...
 
Chapter 4 pc
Chapter 4 pcChapter 4 pc
Chapter 4 pc
 
A Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in ParallelA Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in Parallel
 
Eryk_Kulikowski_a4
Eryk_Kulikowski_a4Eryk_Kulikowski_a4
Eryk_Kulikowski_a4
 
Query optimization for_sensor_networks
Query optimization for_sensor_networksQuery optimization for_sensor_networks
Query optimization for_sensor_networks
 
Chap6 slides
Chap6 slidesChap6 slides
Chap6 slides
 
Chap8 slides
Chap8 slidesChap8 slides
Chap8 slides
 
A QUANTITATIVE ANALYSIS OF HANDOVER TIME AT MAC LAYER FOR WIRELESS MOBILE NET...
A QUANTITATIVE ANALYSIS OF HANDOVER TIME AT MAC LAYER FOR WIRELESS MOBILE NET...A QUANTITATIVE ANALYSIS OF HANDOVER TIME AT MAC LAYER FOR WIRELESS MOBILE NET...
A QUANTITATIVE ANALYSIS OF HANDOVER TIME AT MAC LAYER FOR WIRELESS MOBILE NET...
 
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
 
Chapter 3 pc
Chapter 3 pcChapter 3 pc
Chapter 3 pc
 
Chap2 slides
Chap2 slidesChap2 slides
Chap2 slides
 

Similar a IJCER (www.ijceronline.com) International Journal of computational Engineering research

Comprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPIComprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPIijtsrd
 
Pretzel: optimized Machine Learning framework for low-latency and high throug...
Pretzel: optimized Machine Learning framework for low-latency and high throug...Pretzel: optimized Machine Learning framework for low-latency and high throug...
Pretzel: optimized Machine Learning framework for low-latency and high throug...NECST Lab @ Politecnico di Milano
 
cis97003
cis97003cis97003
cis97003perfj
 
Algorithm selection for sorting in embedded and mobile systems
Algorithm selection for sorting in embedded and mobile systemsAlgorithm selection for sorting in embedded and mobile systems
Algorithm selection for sorting in embedded and mobile systemsJigisha Aryya
 
Job Scheduling on the Grid Environment using Max-Min Firefly Algorithm
Job Scheduling on the Grid Environment using Max-Min  Firefly AlgorithmJob Scheduling on the Grid Environment using Max-Min  Firefly Algorithm
Job Scheduling on the Grid Environment using Max-Min Firefly AlgorithmEditor IJCATR
 
Computer Network Performance Evaluation Based on Different Data Packet Size U...
Computer Network Performance Evaluation Based on Different Data Packet Size U...Computer Network Performance Evaluation Based on Different Data Packet Size U...
Computer Network Performance Evaluation Based on Different Data Packet Size U...Jaipal Dhobale
 
Adaptive check-pointing and replication strategy to tolerate faults in comput...
Adaptive check-pointing and replication strategy to tolerate faults in comput...Adaptive check-pointing and replication strategy to tolerate faults in comput...
Adaptive check-pointing and replication strategy to tolerate faults in comput...IOSR Journals
 
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORMDESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORMsipij
 
Bounded ant colony algorithm for task Allocation on a network of homogeneous ...
Bounded ant colony algorithm for task Allocation on a network of homogeneous ...Bounded ant colony algorithm for task Allocation on a network of homogeneous ...
Bounded ant colony algorithm for task Allocation on a network of homogeneous ...ijcsit
 
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...Derryck Lamptey, MPhil, CISSP
 
user_defined_functions_forinterpolation
user_defined_functions_forinterpolationuser_defined_functions_forinterpolation
user_defined_functions_forinterpolationsushanth tiruvaipati
 
Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...
Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...
Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...paperpublications3
 
Network Flow Pattern Extraction by Clustering Eugine Kang
Network Flow Pattern Extraction by Clustering Eugine KangNetwork Flow Pattern Extraction by Clustering Eugine Kang
Network Flow Pattern Extraction by Clustering Eugine KangEugine Kang
 

Similar a IJCER (www.ijceronline.com) International Journal of computational Engineering research (20)

FrackingPaper
FrackingPaperFrackingPaper
FrackingPaper
 
Comprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPIComprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPI
 
Pretzel: optimized Machine Learning framework for low-latency and high throug...
Pretzel: optimized Machine Learning framework for low-latency and high throug...Pretzel: optimized Machine Learning framework for low-latency and high throug...
Pretzel: optimized Machine Learning framework for low-latency and high throug...
 
cis97003
cis97003cis97003
cis97003
 
Algorithm selection for sorting in embedded and mobile systems
Algorithm selection for sorting in embedded and mobile systemsAlgorithm selection for sorting in embedded and mobile systems
Algorithm selection for sorting in embedded and mobile systems
 
Job Scheduling on the Grid Environment using Max-Min Firefly Algorithm
Job Scheduling on the Grid Environment using Max-Min  Firefly AlgorithmJob Scheduling on the Grid Environment using Max-Min  Firefly Algorithm
Job Scheduling on the Grid Environment using Max-Min Firefly Algorithm
 
A Novel Approach in Scheduling Of the Real- Time Tasks In Heterogeneous Multi...
A Novel Approach in Scheduling Of the Real- Time Tasks In Heterogeneous Multi...A Novel Approach in Scheduling Of the Real- Time Tasks In Heterogeneous Multi...
A Novel Approach in Scheduling Of the Real- Time Tasks In Heterogeneous Multi...
 
Computer Network Performance Evaluation Based on Different Data Packet Size U...
Computer Network Performance Evaluation Based on Different Data Packet Size U...Computer Network Performance Evaluation Based on Different Data Packet Size U...
Computer Network Performance Evaluation Based on Different Data Packet Size U...
 
Gk3611601162
Gk3611601162Gk3611601162
Gk3611601162
 
Adaptive check-pointing and replication strategy to tolerate faults in comput...
Adaptive check-pointing and replication strategy to tolerate faults in comput...Adaptive check-pointing and replication strategy to tolerate faults in comput...
Adaptive check-pointing and replication strategy to tolerate faults in comput...
 
E01113138
E01113138E01113138
E01113138
 
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORMDESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
 
Bounded ant colony algorithm for task Allocation on a network of homogeneous ...
Bounded ant colony algorithm for task Allocation on a network of homogeneous ...Bounded ant colony algorithm for task Allocation on a network of homogeneous ...
Bounded ant colony algorithm for task Allocation on a network of homogeneous ...
 
1844 1849
1844 18491844 1849
1844 1849
 
1844 1849
1844 18491844 1849
1844 1849
 
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
 
user_defined_functions_forinterpolation
user_defined_functions_forinterpolationuser_defined_functions_forinterpolation
user_defined_functions_forinterpolation
 
Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...
Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...
Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...
 
Fulltext
FulltextFulltext
Fulltext
 
Network Flow Pattern Extraction by Clustering Eugine Kang
Network Flow Pattern Extraction by Clustering Eugine KangNetwork Flow Pattern Extraction by Clustering Eugine Kang
Network Flow Pattern Extraction by Clustering Eugine Kang
 

Último

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 

Último (20)

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 

IJCER (www.ijceronline.com) International Journal of computational Engineering research

  • 1. International Journal Of Computational Engineering Research (ijceronline.com) Vol. 2 Issue. 5 A Data Throughput Prediction Using Scheduling And Assignment Technique M.Rajarajeswari 1, P.R.Kandasamy2, T.Ravichandran3 1. Research Scholar, Dept. of Mathematics Karpagam University,Coimbatore. 2.Professor and Head, Dept. of M.CA Hindusthan Institute of Technology,coimbatore. 3.The Principal, Hindusthan Institute of Technology, coimbatore. Abstract: Task computing is a computation to fill the gap between tasks (what user wants to be done), and services (functionalities that are available to the user). Task computing seeks to redefine how users interact with and use computing environments. Wide distributed many-task computing (MTC) environment aims to bridge the gap between two computing paradigms, high throughput computing (HTC) and high-performance computing (HPC). In a wide distributed many-task computing environment, data sharing between participating clusters may become a major performance constriction. In this project, we present the design and implementation of an application-layer data throughput prediction and optimization service for many- task computing in widely distributed environments using Operation research. This service uses multiple parallel TCP streams which are used to find maximum data distribution stream through assignment model which is to improve the end-to-end throughput of data transfers in the network. A novel mathematical model (optimization model) is developed to determine the number of parallel streams, required to achieve the best network performance. This model can predict the optimal number of parallel streams with as few as three prediction points (i.e. three Switching points). We implement this new service in the Stork Data Scheduler model, where the prediction points can be obtained using Iperf and GridFTP samplings technique. Our results show that the prediction cost plus the optimized transfer time is much less than the non optimized transfer time in most cases. As a result, Stork data model evaluates and transfer jobs with optimization service based sampling rate and no. of task is given as input, so our system can be completed much earlier, compared to non optimized data transfer jobs. Key words: Optimization, Assignment Technique, Stork scheduling Data throughput. Modules: 1) Construction of Grid Computing Architecture. 2) Applying Optimization Service. 3) Integration with Stork Data cheduler. 4) Applying Quantity Control of Sampling Data. 5) Performance Comparison. Existing System: TCP is the most widely adopted transport protocol but it has major bottleneck. So we have gone for other different implementation techniques, in both at the transport and application layers, to overcome the inefficient network utilization of the TCP protocol. At the transport layer, different variations of TCP have been implemented to more efficiently utilize high- speed networks. At the application layer, other improvements are proposed on top of the regular TCP, such as opening multiple parallel streams or tuning the buffer size. Parallel TCP streams are able to achieve high network throughput by behaving like a single giant stream, which is the combination of n streams, and getting an unfair share of the available bandwidth. Disadvantage Of System:  In a widely distributed many-task computing ernvionment, data communication between participating clusters may become a major performance bottleneck.  TCP to fully utilize the available network bandwidth. This becomes a major bottleneck, especially in wide-area high speed networks, where both bandwidth and delay properties are too large, which, in turn, results in a large delay before the bandwidth is fully saturated.  Inefficient network utilization. Issn 2250-3005(online) September| 2012 Page 1306
  • 2. International Journal Of Computational Engineering Research (ijceronline.com) Vol. 2 Issue. 5 Proposed System: We present the design and implementation of a service that will provide the user with the optimal number of parallel TCP streams as well as a provision of the estimated time and throughput for a specific data transfer. A novel mathematical model (optimization model) is developed to determine the number of parallel streams, required to achieve the best network performance. This model can predict the optimal number of parallel streams with as few as three prediction points (i.e. three Switching points). We implement this new service in the Stork Data Scheduler model, where the prediction points can be obtained using Iperf and GridFTP samplings technique. Advantage of System:  The prediction models, the quantity control of sampling and the algorithms applied using the mathematical models.  We have improved an existing prediction model by using three prediction points and adapting a full second order equation or an equation where the order is determined dynamically. We have designed an exponentially increasing sampling strategy to get the data pairs for prediction  The algorithm to instantiate the throughput function with respect to the number of parallel streams can avoid the ineffectiveness of the prediction models due to some unexpected sampling data pairs.  We propose to find a solution to satisfy both the time limitation and the accuracy requirements. Our approach doubles the number of parallel streams for every iteration of sampling, and observes the corresponding throughput.  We implement this new service in the Stork Data Scheduler, where the prediction points can be obtained using Iperf and GridFTP samplings Implementation module: In this project we have implemented the optimization service, based on both Iperf and GridFTP. The structure of our design and presents two scenarios based on both, GridFTP and Iperf versions of the service. For the GridFTP version, these hosts would have GridFTP servers. For the Iperf version, these hosts would have Iperf servers running as well as a small remote module (TranServer) that will make a request to Iperf. Optimization server is the orchestrator host, designated to perform the optimization of TCP parameters and store the resultant data. It also has to be recognized by the sites, since the third- party sampling of throughput data will be performed by it. User/Client represents the host that sends out the request of optimization to the server. All of these hosts are connected via LAN. When a user wants to transfer data between two site , the user will first send a request that consists of source and destination addresses, file size, and an optional buffer size parameter to the optimization server, which process the request and respond to the user with the optimal parallel stream number to do the transfer. The buffer size parameter is an optional parameter, which is given to the GridFTP protocol to set the buffer size to a different value than the system set buffer size. At the same time, the optimization server will estimate the optimal throughput that can be achieved, and the time needed to finish the specified transfer between two site. This information is then returned back to the User/Client making the request. Stork is a batch scheduler, specialized in data placement and movement. In this implementation, Stork is extended to support both estimation and optimization tasks. A task is categorized as an estimation task, if only estimated information regarding to the specific data movement is reported without the actual transfer. On the other hand, a task is categorized as optimization, if the specific data movement is performed, according to the optimized estimation results. Mathematical Model : A novel mathematical model (optimization model) is developed to determine the number of parallel streams, required to achieve the best network performance. This model can predict the optimal number of parallel streams with as few as three prediction points (i.e. three Switching points).We propose to find a solution to satisfy both the time limitation and the accuracy requirements. Our approach doubles the number of parallel streams for every iteration of sampling, and observes the corresponding throughput. While the throughput increases, if the slope of the curve is below a threshold between successive iterations, the sampling stops. Another stopping condition is: if the throughput decreases compared to the previous iteration before reaching that threshold. Assignment Problem: Consider an matrix with n rows and n columns, rows will be consider as grid let and columns will as jobs. Like, Issn 2250-3005(online) September| 2012 Page 1307
  • 3. International Journal Of Computational Engineering Research (ijceronline.com) Vol. 2 Issue. 5 Job 1 Job 2 ……. Job n Grid 1 Task Task2 … Task 1 …. n Grid 2 Tas Task … Task k1 2 …. n ……. Tas Task … Task Grid n k1 2 …. n There will be more than one job for each grid so assign problem occur. to solve this problem we are going for new mathematical model to solve this problem. Three condition occur 1) Find out the minim value of each row and subtract least value with same row task value. Make least value as zero. If process matrix diagonally comes as “0” then process stop and job will assign successfully and job assign successfully. Job 1 Job2 Job n Grid 10 3 … 0 1 … Grid . … 7 6 0 2 … ……. . 0 6 … 7 Grid … n . 2) Find out the minim value of each row and Subtract least value with same row task value. Make least value as zero. if column wise matrix diagonally comes as “0” then , then Find out minim value of each column and subtract least value with same row column value. Make least value as zero. Then if process matrix diagonally comes as “0” then process stop and job will assign successfully and job assign successfully. Job 1 Job 2 ……. Job n Grid 1 10 0 … 3 … Grid 2 6 0 … . 7 …. ……. 6 0 … 7 Grid n …. 3) Find out the minim value of each row and subtract least value with same row task value. Make least value as zero. If column wise matrix diagonally comes as “0” then, then Find out minim value of each column and subtract least value with same row column value. Make least value as zero. Then if process matrix diagonally will comes as “0” then process stop and that job will be discard . Issn 2250-3005(online) September| 2012 Page 1308
  • 4. International Journal Of Computational Engineering Research (ijceronline.com) Vol. 2 Issue. 5 Job 1 Job 2 ……. Job n Grid 1 10 0 … 3 … Grid 2 6 0 … . 7 …. ……. 6 0 … 7 Grid n …. Job 1 Job 2 ……. Job n Grid 1 10 0 … 4 … Grid 2 6 0 … . 7 …. ……. 0 6 … 7 Grid n …. Experimental results: Test Scenario Pre-Condition Test Case Expected Output Actual Output Result Applying Client send request Check the client Client Requested will be File received Pass Optimization to EOS server then request will be Valid response to client with some successfully Service that request will be request or not(only txt information (number of transfer to Gridftp format file can be stream taken to transfer the server download/Estimate file and throughput achieve at from Gridftp server ) the time of transfer file) Integration with If More than one Stork data scheduler Priority will be assigned to According to Pass Stork Data Client make make schedule for the each user request(first come priority the request Scheduler request to EOS incoming request first process) / if estimation process in Gridftp server exceed than one request will be give most first server and request/ checking the priority response given to is Estimation or client optimization Of service Performance Comparison Issn 2250-3005(online) September| 2012 Page 1309
  • 5. International Journal Of Computational Engineering Research (ijceronline.com) Vol. 2 Issue. 5 Conclusion: This study describes the design and implementation of a network throughput prediction and optimization service for many-task computing in widely distributed environments. This involves the selection of prediction models, the mathematical models. We have improved an existing prediction model by using three prediction points and adapting a full second order equation, or an equation where the order is determined dynamically. We have designed an exponentially increasing sampling strategy to get the data pairs prediction. We implement this new service in the Stork Data Scheduler, where the prediction points can be obtained using Iperf and GridFTP samplings. The experimental results justify our improved models as well as the algorithms applied to the implementation. When used within the Stork Data Scheduler, the optimization service decreases the total transfer time for a large number of data transfer jobs submitted to the scheduler significantly compared to the non optimized Stork transfers. Reference: [1] I. Raicu, I. Foster, and Y. Zhao, “Many-Task Computing for Grids and Supercomputers,” Proc. IEEE Workshop Many- Task Computing on Grids and Supercomputers (MTAGS), 2008. [2] Louisiana Optical Network Initiative (LONI), http://www.loni.org/, 2010. [3] Energy Sciences Network (ESNet), http://www.es.net/, 2010. [4] TeraGrid, http://www.teragrid.org/, 2010. [5] S. Floyd, “Rfc 3649: Highspeed TCP for Large Congestion Windows,” 2003. [6] R. Kelly, “Scalable TCP: Improving Performance in Highspeed Wide Area Networks,” Computer Comm. Rev., vol. 32, Experiments,” IEEE Network, vol. 19, no. 1, pp. 4-11, Feb. 2005. no. 2, pp. 83- 91, 2003. [7] C. Jin, D.X. Wei, S.H. Low, G. Buhrmaster, J. Bunn, D.H. Choe, R.L.A. Cottrell, J.C. Doyle, W. Feng, O. Martin, H. Newman, F. Paganini, S. Ravot, and S. Singh, “Fast TCP: From Theory to Networks,” Computer Comm. Rev., vol. 32, Experiments,” IEEE Network, vol. 19, no. 1, pp. 4-11, Feb. 2005. [8] H. Sivakumar, S. Bailey, and R.L. Grossman, “Psockets: The Case for Application-Level Network Striping for Data Intensive Applications Using High Speed Wide Area Networks,” Proc. IEEE Super Computing Conf. (SC ’00), p. 63, Nov. 2000. [9] J. Lee, D. Gunter, B. Tierney, B. Allcock, J. Bester, J. Bresnahan, and S. Tuecke, “Applied Techniques for High Bandwidth Data Transfers across Wide Area Networks,” Proc. Int’l Conf. Computing in High Energy and Nuclear Physics (CHEP ’01), Sept. 2001. [10] H. Balakrishman, V.N. Padmanabhan, S. Seshan, and R.H.K.M. Stemm, “TCP Behavior of a Busy Internet Server: Analysis and Improvements,” Proc. IEEE INFOCOM ’98, pp. 252-262, Mar. 1998. [11] T.J. Hacker, B.D. Noble, and B.D. Atley, “Adaptive Data Block Scheduling for Parallel Streams,” Proc. IEEE Int’l Symp. High Performance Distributed Computing (HPDC ’05), pp. 265-275, July 2005. [12] L. Eggert, J. Heideman, and J. Touch, “Effects of Ensemble TCP,” ACM Computer Comm. Rev., vol. 30, no. 1, pp. 15- 29, 2000. [13] R.P. Karrer, J. Park, and J. Kim, “TCP-ROME: Performance and Fairness in Parallel Downloads for Web and Real Time Multimedia Streaming Applications,” technical report, Deutsche Telekom Labs, 2006. [14] D. Lu, Y. Qiao, and P.A. Dinda, “Characterizing and Predicting TCP Throughput on the Wide Area Network,” Proc. IEEE Int’l Conf. Distributed Computing Systems (ICDCS ’05), pp. 414-424, 2005. [15] E. Yildirim, D. Yin, and T. Kosar, “Prediction of Optimal Parallelism Level in Wide Area Data Transfers,” to be published in IEEE Trans. Parallel and Distributed Systems, 2010. Issn 2250-3005(online) September| 2012 Page 1310