SlideShare una empresa de Scribd logo
1 de 6
Descargar para leer sin conexión
P.Annapurna Int. Journal of Engineering Research and Applications www.ijera.com 
ISSN : 2248-9622, Vol. 4, Issue 9( Version 5), September 2014, pp.87-92 
www.ijera.com 87 | P a g e 
Design and Implementation of Quintuple Processor Architecture Using FPGA P.Annapurna1, B.Naga Nancharamma2 1M. TECH STUDENT (VLSI), VKR, VNB&AGK COLLEGE OF ENGINEERING, GUDIVADA, JNTUK (A. P.), INDIA. 2M. TECH, ASST.PROF, VKR, VNB&AGK COLLEGE OF ENGINEERING, GUDIVADA, JNTUK (A. P.), INDIA. ABSTRACT The advanced quintuple processor core is a design philosophy that has become a mainstream in Scientific and engineering applications. Increasing performance and gate capacity of recent FPGA devices permit complex logic systems to be implemented on a single programmable device. The embedded multiprocessors face a new problem with thread synchronization. It is caused by the distributed memory, when thread synchronization is violated the processors can access the same value at the same time. Basically the processor performance can be increased by adopting clock scaling technique and micro architectural Enhancements. Therefore, Designed a new Architecture called Advanced Concurrent Computing. This is implemented on the FPGA chip using VHDL. The advanced Concurrent Computing architecture performs a simultaneous use of both parallel and distributed computing. The full architecture of quintuple processor core designed for realistic to perform arithmetic, logical, shifting and bit manipulation operations. The proposed advanced quintuple processor core contains Homogeneous RISC processors, added with pipelined processing units, multi bus organization and I/O ports along with the other functional elements required to implement embedded SOC solutions. The designed quintuple performance issues like area, speed and power dissipation and propagation delay are analyzed at 90nm process technology using Xilinx tool. 
Keywords – FPGA, VHDL, RISC. 
I. INTRODUCTION 
In recent years, the processor speed is enormously increases due to the advanced computer programming language and components size. The component size can be drastically reduces due to the advanced VLSI technology. In this paper presents, five individual RISC processors are cascaded together and externally connect the local memory blocks for eliminating the thread synchronization problem. The aim of this project is to design and synthesis of Quintuple processor core for advanced concurrent computing in VHDL. Simulation refers to applying stimulus and it is a set of input parameters to the soft code and checking the validity of the output. Synthesis is the conversion of high level description of design to gate level net lists. In the quintuple processor environment multi double the number of processors per chip with each new technology generation. In the past, processor performance was closely tied to clock frequency, which resulted in more power consumption and more successful computing performance gains. This unsustainable trend was reversed by quintuple processor core Technology that delivers processors with higher overall performance and better performance per watt. The microprocessor industry is now at a crossroads due to hitting the limit of the 
power that a single integrated circuit can dissipate. To continue the pattern of increasing performance, semiconductor companies have been forced to replace the single large power-inefficient processor with several smaller power efficient processors operating in parallel and at the same time, because of the growing needs rapidly increasing the design costs and the value in providing flexible platforms, many segments of embedded computing are moving away from ASIC designs to families of programmable platforms. On-chip multi processors appear to be the most promising approach to deliver competitive high performance on these programmable platforms like FPGA. As a result, both general-Purpose computing and advanced computing are being served by on-chip multiprocessor systems. The term quintuple processor core was coined to describe a microprocessor with multiple processors or cores on a single chip. 
Quintuple core consists of multi connected processors that are capable of communicating. This can be done on a single chip where the processors are connected typically by a multi organization bus. Most modern supercomputers have used this multiprocessor technique. A parallel system is presented with more than one task, known as threads. 
RESEARCH ARTICLE OPEN ACCESS
P.Annapurna Int. Journal of Engineering Research and Applications www.ijera.com 
ISSN : 2248-9622, Vol. 4, Issue 9( Version 5), September 2014, pp.87-92 
www.ijera.com 88 | P a g e 
Multiprocessors are necessary to be able to communicate with each other usually via a shared memory, where values that other processors can use and stored. This introduces the new problem of thread synchronization. When thread synchronization is violated, multi processors access the same value at the same time. For this each processor to have some local memory, where the processor does not have to think about thread synchronization to speed up the processor. So that the each processor needs to have a local memory. This can be implemented by quintuple-core processor with multi cores can carry out the multi-threading more efficiently. The multi- threads are basically programmed instructions in a specific sequence to be executed with the single processor core. On the quintuple-core processor, every core presents the capability of executing a sequence of programmed instructions and thus collectively they execute the threads simultaneously. The core difference lies in the architecture, which determines the ability of the processing cores to handle the sequence of instructions simultaneously. So even if you buy a Smartphone with a multi-core processor for Rs 15,000, it will not provide the smoothness and power of high-end smart phones with multi-core mobile processors. Now, it is not only about processing muscle. For now, features such as built-in modems act as radio chips for cellular networks. Those results in slimmer and lighter smart phones. The new technology innovations in the embedded systems led to the rapid miniaturization of the electronic devices which created the necessity for Computer Aided Design tools (CAD) in the hardware design process. The newest addition to this design methodology is the hardware description language (HDL). The HDLs led to the development of new digital system CAD tools, which are now being utilized by VLSI designers. Consider the following code representation: A=A+1, When two processors P1 and P2 execute this code, a number of different outcome may arise due to the real fact that the code will be split into three parts. L1: get A; L2: add 1 to A; L3: store A. It could be that P1 will first execute L1, L2 and L3 and afterward P2 will execute L1, L2 and L3. It could also be that P1 will first execute L1 followed by P2 executing L1 and L2, giving another result. Therefore, some methods for restricting access to shared and distributed resources are necessary. These methods are known as Thread safety or synchronization. Moreover, it is necessary for each processor to have some internal memory, where the processor does not have to think about thread safety to speed up the processor. As an example, each processor needs to have a private stack. The benefits of having a quintuple processor are as follows: 1. Less time to execute entire operation. 2. A more responsive system is created. 3. Different processors can be utilized for different tasks. 4. Faster calculations are made possible. 
II. QUINTUPLE CORE ARCHITECTURE DESIGN 
In this processor mainly follows the concept of parallel computing and distributed computing is applied to construct advanced concurrent computing Architecture. For this Quintuple Processor Core mainly consists of SIMD array, mapping, and local memories of designed architecture as shown in the below fig. Fig.: Designed QUINTUPLE CORE Architecture The Clock is the heart beat of any processor. The processor executes one instruction within one clock period. The Quintuple core is responsible for many concurrent computing operations and it consists of three main modules. These are Quintuple RISC Processors, I/O Ports. Generic RISC Processor are called scalar RISC Processor because they are designed to issue one instruction per cycle, similar to the base scalar processor. The concept of parallel and distributed computing Processor Core mainly consists of SIMD array, mapping, and duplicate memories. Duplicate memories: The most important thing in quad processor mapping is the boundaries and data flow of concurrent processors. Duplicate memories are important and make a design flexible and have a high throughput for both parallel and distributed strategies and are too efficient for concurrent Architecture. 
SIMD array: This SIMD array supports the multidimensional array of data. It allows the simultaneous use of multiple processors for solving a task. An SIMD array is a synchronous array of Processing Elements under the supervision of one control unit and all P.E’s receive the same instruction
P.Annapurna Int. Journal of Engineering Research and Applications www.ijera.com 
ISSN : 2248-9622, Vol. 4, Issue 9( Version 5), September 2014, pp.87-92 
www.ijera.com 89 | P a g e 
broadcast from the control unit, but operate on different multiple data sets from distinct data streams. It is usually loads data into its duplicate memories before starting the computation. All these are working together as a single processor that involves both the parallel and distributed concurrent strategy. RISC Processors (RISC): In the present work, the design of an 8-bit data width Quintuple Reduced Instruction Set Computer (RISC) processor is presented. It was developed with implementation efficiency and simplicity in mind. It has a complete instruction set, program memories and data memories, general purpose registers and a simple Arithmetical Logical Unit (ALU) for basic operations. In this design, most of the instructions are uniform length and similar format. Arithmetic operations are restricted to CPU registers. The Instruction cycle consists of three stages namely fetches, decode and execute. Many numbers of RISC processors give the highest performance per unit area for parallel codes. A larger number of RISC processor cores allow a fine-grained ability to perform dynamic voltage scaling and power down. The RISC processor core with a simple architecture is easier to design and functionally verify. This processor is an economic element that is easy to shut down in the face of catastrophic defects and easier to reconfigure in the face of large parametric variation. 
III. FPGA IMPLEMENTATION 
Modern FPGAs are large enough to implement Quintuple-Processor Systems-on-Chip. Commercial FPGA companies also provide system design tools. To aid our design decisions, we devise a design methodology adapted to the specific challenges we are facing. Our methodology for deriving a digital logic Implementation of the required functionalities encompasses three steps, each of which comprises a set of choices. The criteria we employ to evaluate the final set of choices are complexity and scalability. We define the complexity of a unit by two metrics: the maximum operating frequency and resource usage. Lower complexity is better; that is, the unit has a higher frequency and consumes less resource. We define scalability as the ability of the unit to expand in a chosen dimension with a minimal increase in complexity. 
To determine complexity and scalability, we examine the FPGA mapping of the design. In the first step, we start by choosing a particular architectural technique that provides a certain subset of the required functionalities. The second step considers the structural design choices of each architectural technique in terms of the temporal and spatial parallelism necessary to meet throughput and latency requirements. In the third step, we take into consideration the logic-implementation-specific choices driven by the constraints of the target FPGA chip. It is vital to note that the choices in all three steps are interdependent. For instance, an initially appealing architectural technique may require logic design choices that lead toward a prohibitively expensive implementation. In this case, the choice of the architectural technique has to be reexamined. In case there are feasible implementations of the selected architectural technique, the required performance can be achieved by balancing the trade- off between throughputs and operating frequency. Higher throughput necessitates higher design complexity. 
IV. RESULTS
P.Annapurna Int. Journal of Engineering Research and Applications www.ijera.com 
ISSN : 2248-9622, Vol. 4, Issue 9( Version 5), September 2014, pp.87-92 
www.ijera.com 90 | P a g e 
Fig.i: Single Processor Executing Addition operation in both parallel & distributing computing Fig.ii: Quintuple Processor Executing 1’s complement Operation Fig.iii. Quintuple Processor Executing Subtraction operation.
P.Annapurna Int. Journal of Engineering Research and Applications www.ijera.com 
ISSN : 2248-9622, Vol. 4, Issue 9( Version 5), September 2014, pp.87-92 
www.ijera.com 91 | P a g e 
Fig.iv: Quintuple Processor Executing Logical Shift Right Operation 
V. CONCLUSION 
The designed quintuple processor core is very efficient and powerful core architecture, which is analyzed by using Xilinx software and simulation by using model sim simulator. Although the quintuple processor core was based on well-proven superscalar architectural techniques, including parallel execution and register renaming, these techniques were originally devised for fine-grained parallelism among instructions. The designed Quintuple core implementation using FPGA which is very advantageous to the architecture ,because it uses the parallel processing and pipielinig techniques are easily implemented in FPGA.The advanced concurrent operation of quintuple processor gives the high speed,less clock cycles to entire operation and takes less power. 
REFERENCES 
[1] Tai-Hua, Lu, Chung-Ho Chen, KuenJong Lee. “Effective Hybrid Test Program Development for Software-Based Self-Testing of Quad Cores” ,IEEE Manuscript received April 03, 2012, revised August 14, 2012, first published December 18, 2012. [2] Gohringer, D., Hubner, M.Perschke, T., Becker. J. “New Dimensions for Quad core Architectures Demand Heterogeneity”, Infrastructure and Performance through reconfigurability The EMPSoC 
Approach”.In Proc of FPL 2010, PP.495-498, Sept 2010. [3] Lysaght, P. Blodget, B. Mason, J.Young, B.Bridgford. “Invited Paper: Quad core design Methodologies and CAD Tools for Dynamic Reconfiguration of Xilinx FPGAs”. In Proceedings of FPL 2009, August 2009. [4] D.Tullsen, S. Eggers, and H. Levy, “Simultaneous Multithreading: Maximizing On- Chip Parallelism,” Proc. 22nd Ann. Int’l Symp. Computer Architecture, ACM Press, New York,1995, pp. 392-403. [5] J. lo,S. Eggers, J. Emer, H. Levy, R. Sstamm, and D. Tullsen.”Converting thread level parallelism into instruction-level parallelism via simultaneous multithreading”. ACM Transactions on Computer Systems, 15(2), pp. 323-354, August 1997. [6] Lance, Hammond, Basem, Ku.umen “ANayfeh, Kunle Olukotun.ASingle-Chip multiprocessor. IEEE Computer”, vol. 30, no. 9, pp. 79--85, September1997. [7] J. Borkenhagen, R. Eickemeyer, and R. Kalla : “A Multithreaded PowerPC Processor for Commercial Servers, IBM Journal of Research and Development”, November 2000, Vol. 44, No. 6, pp.1995.
P.Annapurna Int. Journal of Engineering Research and Applications www.ijera.com 
ISSN : 2248-9622, Vol. 4, Issue 9( Version 5), September 2014, pp.87-92 
www.ijera.com 92 | P a g e 
AUTHORS P.ANNAPURNA Completed B.Tech in 2006in BITIT,HINDUPUR and presently doing her Master degree in VLSI from JNTU Kakinada University in V.K.R,V.N.B&A.G.K College of Engineering College, Gudivada. Areas of interests are VLSI,Communication systems. B.Naga nancharamma Completed B.Tech in 2005, M.tech in 2012. Areas of interests are low power designing, real-time applications, Embedded Systems, Digital Designing. She is an Asst. Professor of ECE in V.K.R, V.N.B&A.G.K College of engineering, Gudivada.

Más contenido relacionado

La actualidad más candente

Presentation Thesis - Convolutional net on the Xeon Phi using SIMD - Gaurav R...
Presentation Thesis - Convolutional net on the Xeon Phi using SIMD - Gaurav R...Presentation Thesis - Convolutional net on the Xeon Phi using SIMD - Gaurav R...
Presentation Thesis - Convolutional net on the Xeon Phi using SIMD - Gaurav R...
Gaurav Raina
 
Thesis Report - Gaurav Raina MSc ES - v2
Thesis Report - Gaurav Raina MSc ES - v2Thesis Report - Gaurav Raina MSc ES - v2
Thesis Report - Gaurav Raina MSc ES - v2
Gaurav Raina
 
Shakti Processor Roadmap
Shakti Processor RoadmapShakti Processor Roadmap
Shakti Processor Roadmap
smadhu2048
 
intel Presentation
intel Presentationintel Presentation
intel Presentation
finance6
 
Exceeding the Limits of Air Cooling to Unlock Greater Potential in HPC
Exceeding the Limits of Air Cooling to Unlock Greater Potential in HPCExceeding the Limits of Air Cooling to Unlock Greater Potential in HPC
Exceeding the Limits of Air Cooling to Unlock Greater Potential in HPC
inside-BigData.com
 

La actualidad más candente (20)

Presentation Thesis - Convolutional net on the Xeon Phi using SIMD - Gaurav R...
Presentation Thesis - Convolutional net on the Xeon Phi using SIMD - Gaurav R...Presentation Thesis - Convolutional net on the Xeon Phi using SIMD - Gaurav R...
Presentation Thesis - Convolutional net on the Xeon Phi using SIMD - Gaurav R...
 
Thesis Report - Gaurav Raina MSc ES - v2
Thesis Report - Gaurav Raina MSc ES - v2Thesis Report - Gaurav Raina MSc ES - v2
Thesis Report - Gaurav Raina MSc ES - v2
 
intel core i7
intel core i7 intel core i7
intel core i7
 
Increasing Throughput per Node for Content Delivery Networks
Increasing Throughput per Node for Content Delivery NetworksIncreasing Throughput per Node for Content Delivery Networks
Increasing Throughput per Node for Content Delivery Networks
 
NNSA Explorations: ARM for Supercomputing
NNSA Explorations: ARM for SupercomputingNNSA Explorations: ARM for Supercomputing
NNSA Explorations: ARM for Supercomputing
 
Specialized parallel computing
Specialized parallel computingSpecialized parallel computing
Specialized parallel computing
 
2018 Intel AI Developer Conference Keynote
2018 Intel AI Developer Conference Keynote2018 Intel AI Developer Conference Keynote
2018 Intel AI Developer Conference Keynote
 
Early Successes Debugging with TotalView on the Intel Xeon Phi Coprocessor
Early Successes Debugging with TotalView on the Intel Xeon Phi CoprocessorEarly Successes Debugging with TotalView on the Intel Xeon Phi Coprocessor
Early Successes Debugging with TotalView on the Intel Xeon Phi Coprocessor
 
Intel(R)Core(Tm)I7 Desktop Processor Product Brief
Intel(R)Core(Tm)I7 Desktop Processor Product BriefIntel(R)Core(Tm)I7 Desktop Processor Product Brief
Intel(R)Core(Tm)I7 Desktop Processor Product Brief
 
Large Model support and Distribute deep learning
Large Model support and Distribute deep learningLarge Model support and Distribute deep learning
Large Model support and Distribute deep learning
 
Hyper-threading (Xeon Microprocessor by intel)
Hyper-threading (Xeon Microprocessor by intel) Hyper-threading (Xeon Microprocessor by intel)
Hyper-threading (Xeon Microprocessor by intel)
 
Intel Microarchitecture (Nehalem) and its Applications on Videogames
Intel Microarchitecture (Nehalem) and its Applications on VideogamesIntel Microarchitecture (Nehalem) and its Applications on Videogames
Intel Microarchitecture (Nehalem) and its Applications on Videogames
 
Shakti Processor Roadmap
Shakti Processor RoadmapShakti Processor Roadmap
Shakti Processor Roadmap
 
POWER10 innovations for HPC
POWER10 innovations for HPCPOWER10 innovations for HPC
POWER10 innovations for HPC
 
Using VPP and SRIO-V with Clear Containers
Using VPP and SRIO-V with Clear ContainersUsing VPP and SRIO-V with Clear Containers
Using VPP and SRIO-V with Clear Containers
 
intel Presentation
intel Presentationintel Presentation
intel Presentation
 
Intel core i7 processors
Intel core i7 processorsIntel core i7 processors
Intel core i7 processors
 
Exceeding the Limits of Air Cooling to Unlock Greater Potential in HPC
Exceeding the Limits of Air Cooling to Unlock Greater Potential in HPCExceeding the Limits of Air Cooling to Unlock Greater Potential in HPC
Exceeding the Limits of Air Cooling to Unlock Greater Potential in HPC
 
Intel Core i7 Processors
Intel Core i7 ProcessorsIntel Core i7 Processors
Intel Core i7 Processors
 
Accelerate Big Data Processing with High-Performance Computing Technologies
Accelerate Big Data Processing with High-Performance Computing TechnologiesAccelerate Big Data Processing with High-Performance Computing Technologies
Accelerate Big Data Processing with High-Performance Computing Technologies
 

Destacado

C:\Fakepath\Desarrollo Organizacional De La Utes ComisióN Chiquita Ii
C:\Fakepath\Desarrollo Organizacional De La   Utes ComisióN Chiquita IiC:\Fakepath\Desarrollo Organizacional De La   Utes ComisióN Chiquita Ii
C:\Fakepath\Desarrollo Organizacional De La Utes ComisióN Chiquita Ii
utesinfofcm
 
African Paradise Botswana
African Paradise BotswanaAfrican Paradise Botswana
African Paradise Botswana
Manuela Szekely
 
National geographic
National geographicNational geographic
National geographic
GIA VER
 
The Heart Of Anatolia (Erzurum)
The Heart Of Anatolia (Erzurum)The Heart Of Anatolia (Erzurum)
The Heart Of Anatolia (Erzurum)
İbrahim DAL
 
9 manual alarme-olimpus-boxer-light
9 manual alarme-olimpus-boxer-light9 manual alarme-olimpus-boxer-light
9 manual alarme-olimpus-boxer-light
Miguel Eletricista
 
TCC Marcus Winicius e Rayner Max - Analise de Desempenho de um Servidor Proxy...
TCC Marcus Winicius e Rayner Max - Analise de Desempenho de um Servidor Proxy...TCC Marcus Winicius e Rayner Max - Analise de Desempenho de um Servidor Proxy...
TCC Marcus Winicius e Rayner Max - Analise de Desempenho de um Servidor Proxy...
Marcus Winicius
 
Instrumentos slide share
Instrumentos slide shareInstrumentos slide share
Instrumentos slide share
aixarosa
 

Destacado (20)

Waves
WavesWaves
Waves
 
Cats 1
Cats 1Cats 1
Cats 1
 
Problema gauss a
Problema gauss aProblema gauss a
Problema gauss a
 
C:\Fakepath\Desarrollo Organizacional De La Utes ComisióN Chiquita Ii
C:\Fakepath\Desarrollo Organizacional De La   Utes ComisióN Chiquita IiC:\Fakepath\Desarrollo Organizacional De La   Utes ComisióN Chiquita Ii
C:\Fakepath\Desarrollo Organizacional De La Utes ComisióN Chiquita Ii
 
Instrumentos slide share
Instrumentos slide shareInstrumentos slide share
Instrumentos slide share
 
African Paradise Botswana
African Paradise BotswanaAfrican Paradise Botswana
African Paradise Botswana
 
National geographic
National geographicNational geographic
National geographic
 
The Heart Of Anatolia (Erzurum)
The Heart Of Anatolia (Erzurum)The Heart Of Anatolia (Erzurum)
The Heart Of Anatolia (Erzurum)
 
1
11
1
 
9 manual alarme-olimpus-boxer-light
9 manual alarme-olimpus-boxer-light9 manual alarme-olimpus-boxer-light
9 manual alarme-olimpus-boxer-light
 
ECOTURISMO
ECOTURISMOECOTURISMO
ECOTURISMO
 
Rotation Invariant Matching of Partial Shoeprints
Rotation Invariant Matching of Partial ShoeprintsRotation Invariant Matching of Partial Shoeprints
Rotation Invariant Matching of Partial Shoeprints
 
Tejiendo las bases para una teoría del emplazamiento
Tejiendo las bases para una teoría del emplazamientoTejiendo las bases para una teoría del emplazamiento
Tejiendo las bases para una teoría del emplazamiento
 
Design of a Smart Meter for the Indian Energy Scenario
Design of a Smart Meter for the Indian Energy ScenarioDesign of a Smart Meter for the Indian Energy Scenario
Design of a Smart Meter for the Indian Energy Scenario
 
Presentacion postgrado
Presentacion postgradoPresentacion postgrado
Presentacion postgrado
 
Segundo trabajo
Segundo trabajoSegundo trabajo
Segundo trabajo
 
TCC Marcus Winicius e Rayner Max - Analise de Desempenho de um Servidor Proxy...
TCC Marcus Winicius e Rayner Max - Analise de Desempenho de um Servidor Proxy...TCC Marcus Winicius e Rayner Max - Analise de Desempenho de um Servidor Proxy...
TCC Marcus Winicius e Rayner Max - Analise de Desempenho de um Servidor Proxy...
 
2015年面板產業景氣展望
2015年面板產業景氣展望2015年面板產業景氣展望
2015年面板產業景氣展望
 
Aula para 5º ano
Aula para 5º anoAula para 5º ano
Aula para 5º ano
 
Instrumentos slide share
Instrumentos slide shareInstrumentos slide share
Instrumentos slide share
 

Similar a Design and Implementation of Quintuple Processor Architecture Using FPGA

I understand that physics and hardware emmaded on the use of finete .pdf
I understand that physics and hardware emmaded on the use of finete .pdfI understand that physics and hardware emmaded on the use of finete .pdf
I understand that physics and hardware emmaded on the use of finete .pdf
anil0878
 
Top 10 Supercomputers With Descriptive Information & Analysis
Top 10 Supercomputers With Descriptive Information & AnalysisTop 10 Supercomputers With Descriptive Information & Analysis
Top 10 Supercomputers With Descriptive Information & Analysis
NomanSiddiqui41
 
37248136-Nano-Technology.pdf
37248136-Nano-Technology.pdf37248136-Nano-Technology.pdf
37248136-Nano-Technology.pdf
TB107thippeswamyM
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045
Editor IJARCET
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045
Editor IJARCET
 
Synergistic processing in cell's multicore architecture
Synergistic processing in cell's multicore architectureSynergistic processing in cell's multicore architecture
Synergistic processing in cell's multicore architecture
Michael Gschwind
 

Similar a Design and Implementation of Quintuple Processor Architecture Using FPGA (20)

Ef35745749
Ef35745749Ef35745749
Ef35745749
 
Implementation of RISC-Based Architecture for Low power applications
Implementation of RISC-Based Architecture for Low power applicationsImplementation of RISC-Based Architecture for Low power applications
Implementation of RISC-Based Architecture for Low power applications
 
Review paper on 32-BIT RISC processor with floating point arithmetic
Review paper on 32-BIT RISC processor with floating point arithmeticReview paper on 32-BIT RISC processor with floating point arithmetic
Review paper on 32-BIT RISC processor with floating point arithmetic
 
I understand that physics and hardware emmaded on the use of finete .pdf
I understand that physics and hardware emmaded on the use of finete .pdfI understand that physics and hardware emmaded on the use of finete .pdf
I understand that physics and hardware emmaded on the use of finete .pdf
 
A New Direction for Computer Architecture Research
A New Direction for Computer Architecture ResearchA New Direction for Computer Architecture Research
A New Direction for Computer Architecture Research
 
Nehalem
NehalemNehalem
Nehalem
 
chameleon chip
chameleon chipchameleon chip
chameleon chip
 
Top 10 Supercomputers With Descriptive Information & Analysis
Top 10 Supercomputers With Descriptive Information & AnalysisTop 10 Supercomputers With Descriptive Information & Analysis
Top 10 Supercomputers With Descriptive Information & Analysis
 
W04505116121
W04505116121W04505116121
W04505116121
 
37248136-Nano-Technology.pdf
37248136-Nano-Technology.pdf37248136-Nano-Technology.pdf
37248136-Nano-Technology.pdf
 
Accelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous PlatformsAccelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous Platforms
 
Multi-Core on Chip Architecture *doc - IK
Multi-Core on Chip Architecture *doc - IKMulti-Core on Chip Architecture *doc - IK
Multi-Core on Chip Architecture *doc - IK
 
Building efficient 5G NR base stations with Intel® Xeon® Scalable Processors
Building efficient 5G NR base stations with Intel® Xeon® Scalable Processors Building efficient 5G NR base stations with Intel® Xeon® Scalable Processors
Building efficient 5G NR base stations with Intel® Xeon® Scalable Processors
 
14 284-291
14 284-29114 284-291
14 284-291
 
Area Optimized Implementation For Mips Processor
Area Optimized Implementation For Mips ProcessorArea Optimized Implementation For Mips Processor
Area Optimized Implementation For Mips Processor
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045
 
Seminar Accelerating Business Using Microservices Architecture in Digital Age...
Seminar Accelerating Business Using Microservices Architecture in Digital Age...Seminar Accelerating Business Using Microservices Architecture in Digital Age...
Seminar Accelerating Business Using Microservices Architecture in Digital Age...
 
Synergistic processing in cell's multicore architecture
Synergistic processing in cell's multicore architectureSynergistic processing in cell's multicore architecture
Synergistic processing in cell's multicore architecture
 
Run-Time Adaptive Processor Allocation of Self-Configurable Intel IXP2400 Net...
Run-Time Adaptive Processor Allocation of Self-Configurable Intel IXP2400 Net...Run-Time Adaptive Processor Allocation of Self-Configurable Intel IXP2400 Net...
Run-Time Adaptive Processor Allocation of Self-Configurable Intel IXP2400 Net...
 

Último

VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Christo Ananth
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
Tonystark477637
 

Último (20)

Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 

Design and Implementation of Quintuple Processor Architecture Using FPGA

  • 1. P.Annapurna Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 9( Version 5), September 2014, pp.87-92 www.ijera.com 87 | P a g e Design and Implementation of Quintuple Processor Architecture Using FPGA P.Annapurna1, B.Naga Nancharamma2 1M. TECH STUDENT (VLSI), VKR, VNB&AGK COLLEGE OF ENGINEERING, GUDIVADA, JNTUK (A. P.), INDIA. 2M. TECH, ASST.PROF, VKR, VNB&AGK COLLEGE OF ENGINEERING, GUDIVADA, JNTUK (A. P.), INDIA. ABSTRACT The advanced quintuple processor core is a design philosophy that has become a mainstream in Scientific and engineering applications. Increasing performance and gate capacity of recent FPGA devices permit complex logic systems to be implemented on a single programmable device. The embedded multiprocessors face a new problem with thread synchronization. It is caused by the distributed memory, when thread synchronization is violated the processors can access the same value at the same time. Basically the processor performance can be increased by adopting clock scaling technique and micro architectural Enhancements. Therefore, Designed a new Architecture called Advanced Concurrent Computing. This is implemented on the FPGA chip using VHDL. The advanced Concurrent Computing architecture performs a simultaneous use of both parallel and distributed computing. The full architecture of quintuple processor core designed for realistic to perform arithmetic, logical, shifting and bit manipulation operations. The proposed advanced quintuple processor core contains Homogeneous RISC processors, added with pipelined processing units, multi bus organization and I/O ports along with the other functional elements required to implement embedded SOC solutions. The designed quintuple performance issues like area, speed and power dissipation and propagation delay are analyzed at 90nm process technology using Xilinx tool. Keywords – FPGA, VHDL, RISC. I. INTRODUCTION In recent years, the processor speed is enormously increases due to the advanced computer programming language and components size. The component size can be drastically reduces due to the advanced VLSI technology. In this paper presents, five individual RISC processors are cascaded together and externally connect the local memory blocks for eliminating the thread synchronization problem. The aim of this project is to design and synthesis of Quintuple processor core for advanced concurrent computing in VHDL. Simulation refers to applying stimulus and it is a set of input parameters to the soft code and checking the validity of the output. Synthesis is the conversion of high level description of design to gate level net lists. In the quintuple processor environment multi double the number of processors per chip with each new technology generation. In the past, processor performance was closely tied to clock frequency, which resulted in more power consumption and more successful computing performance gains. This unsustainable trend was reversed by quintuple processor core Technology that delivers processors with higher overall performance and better performance per watt. The microprocessor industry is now at a crossroads due to hitting the limit of the power that a single integrated circuit can dissipate. To continue the pattern of increasing performance, semiconductor companies have been forced to replace the single large power-inefficient processor with several smaller power efficient processors operating in parallel and at the same time, because of the growing needs rapidly increasing the design costs and the value in providing flexible platforms, many segments of embedded computing are moving away from ASIC designs to families of programmable platforms. On-chip multi processors appear to be the most promising approach to deliver competitive high performance on these programmable platforms like FPGA. As a result, both general-Purpose computing and advanced computing are being served by on-chip multiprocessor systems. The term quintuple processor core was coined to describe a microprocessor with multiple processors or cores on a single chip. Quintuple core consists of multi connected processors that are capable of communicating. This can be done on a single chip where the processors are connected typically by a multi organization bus. Most modern supercomputers have used this multiprocessor technique. A parallel system is presented with more than one task, known as threads. RESEARCH ARTICLE OPEN ACCESS
  • 2. P.Annapurna Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 9( Version 5), September 2014, pp.87-92 www.ijera.com 88 | P a g e Multiprocessors are necessary to be able to communicate with each other usually via a shared memory, where values that other processors can use and stored. This introduces the new problem of thread synchronization. When thread synchronization is violated, multi processors access the same value at the same time. For this each processor to have some local memory, where the processor does not have to think about thread synchronization to speed up the processor. So that the each processor needs to have a local memory. This can be implemented by quintuple-core processor with multi cores can carry out the multi-threading more efficiently. The multi- threads are basically programmed instructions in a specific sequence to be executed with the single processor core. On the quintuple-core processor, every core presents the capability of executing a sequence of programmed instructions and thus collectively they execute the threads simultaneously. The core difference lies in the architecture, which determines the ability of the processing cores to handle the sequence of instructions simultaneously. So even if you buy a Smartphone with a multi-core processor for Rs 15,000, it will not provide the smoothness and power of high-end smart phones with multi-core mobile processors. Now, it is not only about processing muscle. For now, features such as built-in modems act as radio chips for cellular networks. Those results in slimmer and lighter smart phones. The new technology innovations in the embedded systems led to the rapid miniaturization of the electronic devices which created the necessity for Computer Aided Design tools (CAD) in the hardware design process. The newest addition to this design methodology is the hardware description language (HDL). The HDLs led to the development of new digital system CAD tools, which are now being utilized by VLSI designers. Consider the following code representation: A=A+1, When two processors P1 and P2 execute this code, a number of different outcome may arise due to the real fact that the code will be split into three parts. L1: get A; L2: add 1 to A; L3: store A. It could be that P1 will first execute L1, L2 and L3 and afterward P2 will execute L1, L2 and L3. It could also be that P1 will first execute L1 followed by P2 executing L1 and L2, giving another result. Therefore, some methods for restricting access to shared and distributed resources are necessary. These methods are known as Thread safety or synchronization. Moreover, it is necessary for each processor to have some internal memory, where the processor does not have to think about thread safety to speed up the processor. As an example, each processor needs to have a private stack. The benefits of having a quintuple processor are as follows: 1. Less time to execute entire operation. 2. A more responsive system is created. 3. Different processors can be utilized for different tasks. 4. Faster calculations are made possible. II. QUINTUPLE CORE ARCHITECTURE DESIGN In this processor mainly follows the concept of parallel computing and distributed computing is applied to construct advanced concurrent computing Architecture. For this Quintuple Processor Core mainly consists of SIMD array, mapping, and local memories of designed architecture as shown in the below fig. Fig.: Designed QUINTUPLE CORE Architecture The Clock is the heart beat of any processor. The processor executes one instruction within one clock period. The Quintuple core is responsible for many concurrent computing operations and it consists of three main modules. These are Quintuple RISC Processors, I/O Ports. Generic RISC Processor are called scalar RISC Processor because they are designed to issue one instruction per cycle, similar to the base scalar processor. The concept of parallel and distributed computing Processor Core mainly consists of SIMD array, mapping, and duplicate memories. Duplicate memories: The most important thing in quad processor mapping is the boundaries and data flow of concurrent processors. Duplicate memories are important and make a design flexible and have a high throughput for both parallel and distributed strategies and are too efficient for concurrent Architecture. SIMD array: This SIMD array supports the multidimensional array of data. It allows the simultaneous use of multiple processors for solving a task. An SIMD array is a synchronous array of Processing Elements under the supervision of one control unit and all P.E’s receive the same instruction
  • 3. P.Annapurna Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 9( Version 5), September 2014, pp.87-92 www.ijera.com 89 | P a g e broadcast from the control unit, but operate on different multiple data sets from distinct data streams. It is usually loads data into its duplicate memories before starting the computation. All these are working together as a single processor that involves both the parallel and distributed concurrent strategy. RISC Processors (RISC): In the present work, the design of an 8-bit data width Quintuple Reduced Instruction Set Computer (RISC) processor is presented. It was developed with implementation efficiency and simplicity in mind. It has a complete instruction set, program memories and data memories, general purpose registers and a simple Arithmetical Logical Unit (ALU) for basic operations. In this design, most of the instructions are uniform length and similar format. Arithmetic operations are restricted to CPU registers. The Instruction cycle consists of three stages namely fetches, decode and execute. Many numbers of RISC processors give the highest performance per unit area for parallel codes. A larger number of RISC processor cores allow a fine-grained ability to perform dynamic voltage scaling and power down. The RISC processor core with a simple architecture is easier to design and functionally verify. This processor is an economic element that is easy to shut down in the face of catastrophic defects and easier to reconfigure in the face of large parametric variation. III. FPGA IMPLEMENTATION Modern FPGAs are large enough to implement Quintuple-Processor Systems-on-Chip. Commercial FPGA companies also provide system design tools. To aid our design decisions, we devise a design methodology adapted to the specific challenges we are facing. Our methodology for deriving a digital logic Implementation of the required functionalities encompasses three steps, each of which comprises a set of choices. The criteria we employ to evaluate the final set of choices are complexity and scalability. We define the complexity of a unit by two metrics: the maximum operating frequency and resource usage. Lower complexity is better; that is, the unit has a higher frequency and consumes less resource. We define scalability as the ability of the unit to expand in a chosen dimension with a minimal increase in complexity. To determine complexity and scalability, we examine the FPGA mapping of the design. In the first step, we start by choosing a particular architectural technique that provides a certain subset of the required functionalities. The second step considers the structural design choices of each architectural technique in terms of the temporal and spatial parallelism necessary to meet throughput and latency requirements. In the third step, we take into consideration the logic-implementation-specific choices driven by the constraints of the target FPGA chip. It is vital to note that the choices in all three steps are interdependent. For instance, an initially appealing architectural technique may require logic design choices that lead toward a prohibitively expensive implementation. In this case, the choice of the architectural technique has to be reexamined. In case there are feasible implementations of the selected architectural technique, the required performance can be achieved by balancing the trade- off between throughputs and operating frequency. Higher throughput necessitates higher design complexity. IV. RESULTS
  • 4. P.Annapurna Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 9( Version 5), September 2014, pp.87-92 www.ijera.com 90 | P a g e Fig.i: Single Processor Executing Addition operation in both parallel & distributing computing Fig.ii: Quintuple Processor Executing 1’s complement Operation Fig.iii. Quintuple Processor Executing Subtraction operation.
  • 5. P.Annapurna Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 9( Version 5), September 2014, pp.87-92 www.ijera.com 91 | P a g e Fig.iv: Quintuple Processor Executing Logical Shift Right Operation V. CONCLUSION The designed quintuple processor core is very efficient and powerful core architecture, which is analyzed by using Xilinx software and simulation by using model sim simulator. Although the quintuple processor core was based on well-proven superscalar architectural techniques, including parallel execution and register renaming, these techniques were originally devised for fine-grained parallelism among instructions. The designed Quintuple core implementation using FPGA which is very advantageous to the architecture ,because it uses the parallel processing and pipielinig techniques are easily implemented in FPGA.The advanced concurrent operation of quintuple processor gives the high speed,less clock cycles to entire operation and takes less power. REFERENCES [1] Tai-Hua, Lu, Chung-Ho Chen, KuenJong Lee. “Effective Hybrid Test Program Development for Software-Based Self-Testing of Quad Cores” ,IEEE Manuscript received April 03, 2012, revised August 14, 2012, first published December 18, 2012. [2] Gohringer, D., Hubner, M.Perschke, T., Becker. J. “New Dimensions for Quad core Architectures Demand Heterogeneity”, Infrastructure and Performance through reconfigurability The EMPSoC Approach”.In Proc of FPL 2010, PP.495-498, Sept 2010. [3] Lysaght, P. Blodget, B. Mason, J.Young, B.Bridgford. “Invited Paper: Quad core design Methodologies and CAD Tools for Dynamic Reconfiguration of Xilinx FPGAs”. In Proceedings of FPL 2009, August 2009. [4] D.Tullsen, S. Eggers, and H. Levy, “Simultaneous Multithreading: Maximizing On- Chip Parallelism,” Proc. 22nd Ann. Int’l Symp. Computer Architecture, ACM Press, New York,1995, pp. 392-403. [5] J. lo,S. Eggers, J. Emer, H. Levy, R. Sstamm, and D. Tullsen.”Converting thread level parallelism into instruction-level parallelism via simultaneous multithreading”. ACM Transactions on Computer Systems, 15(2), pp. 323-354, August 1997. [6] Lance, Hammond, Basem, Ku.umen “ANayfeh, Kunle Olukotun.ASingle-Chip multiprocessor. IEEE Computer”, vol. 30, no. 9, pp. 79--85, September1997. [7] J. Borkenhagen, R. Eickemeyer, and R. Kalla : “A Multithreaded PowerPC Processor for Commercial Servers, IBM Journal of Research and Development”, November 2000, Vol. 44, No. 6, pp.1995.
  • 6. P.Annapurna Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 9( Version 5), September 2014, pp.87-92 www.ijera.com 92 | P a g e AUTHORS P.ANNAPURNA Completed B.Tech in 2006in BITIT,HINDUPUR and presently doing her Master degree in VLSI from JNTU Kakinada University in V.K.R,V.N.B&A.G.K College of Engineering College, Gudivada. Areas of interests are VLSI,Communication systems. B.Naga nancharamma Completed B.Tech in 2005, M.tech in 2012. Areas of interests are low power designing, real-time applications, Embedded Systems, Digital Designing. She is an Asst. Professor of ECE in V.K.R, V.N.B&A.G.K College of engineering, Gudivada.