5. Why choose DME?
DME supports centralized shared memory (CM) and distributed shared memory (DSM).
For medium and large system, we prefer to use DSM structures, since centralized
memory has already become a bottleneck. Memories are preferrably distributed,
featuring good scalability and less delay of memory access.
5
7. Why choose DME
Processor1 Processor2
Cache Cache
Interface Interface Local
NI modules in DME
DME P1 M4 Memory
are designed to DME P1 M2 Local
support Standard Memory NI
bus interface, e.g. NI We support:
AHB,APB,AXI, OCP 1. common bus protocol,
easy to integrate into
existing system;
Network-on-Chip (NoC) 2. configurable for different
DME can perform as
data formats;
a bridge connecting
different IPs with
Network
NI NI
DME P1 M2 DME P1 M0
(Local
Interface DDR-Interface
Memory)
DDR controller
Custom IP
DDR Memory
7
8. DME Product
DME comes in three flavours
P1 M0 – DME Light – Small footprint, low power,
for memory access and similar standard tasks
P1 M2 – DME Flex – Programmable, fully featured
for maximum flexibility and customization
P1 M4 – DME Flex Plus – Programmable, fully
featured, with maximum performance
8
9. DME Product
DME P1 M0 – DME Light
DME
Interconnect-
IP IP-Interface Interconnect
Interface
Feature list:
Transaction Scheduler
1.DSM(distributed shared memory)
2.Private|Shared division
3.Synchronization
4.Privilege level setting (3/2013)
Memory Interface Crossbar
5.IP interface support (AHB,APB) (4/2013)
6.Interconnection interface support (AHB,APB)
(4/2013)
Local Memory Local Memory Local Memory Local Memory
9
10. DME Product
DME P1 M2
Data M anagement Engine
Interconnect
CPU CPU Interface Interconnect
Interface
Feature list:
Transaction Scheduler
h
1.DSM(distributed shared memory)
t
a
p
s
2.Private|Shared division
M ini- M ini - s
a
processor processor p
y
3.V2P
B
4.Synchronization
5.Privilege level setting (3/2013)
M emory Interface Crossbar
6.IP interface support (AHB,APB) (4/2013)
7.Interconnection interface support (AHB,APB)
Local Local (4/2013)
M emory M emory
8.DMA-1, DMA-2 (3/2013)
9.Message passing (4/2013)
10.micro-programming 10
11. DME Product
DME P1 M4
Data M anagement Engine
Interconnect
CPU CPU Interface Interconnect
Interface
Feature list:
Transaction Scheduler
h
t
1.DSM(distributed shared memory)
a
p
M ini- M ini - M ini - M ini - s
s
2.Private|Shared division
a
processor processor processor processor p 3.V2P
y
B
4.Synchronization
5.Privilege level setting (3/2013)
M emory Interface Crossbar
6.IP interface support (AHB,APB) (4/2013)
Local Local Local Local
7.Interconnection interface support (AHB,APB)
M emory M emory M emory M emory (4/2013)
8.DMA-1, DMA-2 (3/2013)
9.Message passing (4/2013)
10.micro-programming 11
12. DME Planned Features
Features
AXI Q2 2013
DMA-3 Q2 2013
Striding access Q2 2013
Data shuffling Q2 2013
SystemC Model, SIMICS Model Q3 2013
Transaction ordering support (memory consistency) Q3 2013
Dynamic memory allocation Q4 2013
OCP Q4 2013
Directory based cache coherence On Demand 12
14. Application Example: H.264 decoder
Task
P1 P2 P3 Pn
Distributor
ITRANS/ INTRA
ENTROPY DEQUANT PREDICTION
P1 P2 P3 P4 P5 P6
Load Private Shared
Store
Mem Mem
3 Private Shared
NODES MEM MEM MEM MEM MEM MEM
Mem Mem
Private Shared
Mem Mem
P7 P8 P9 P10 P11 P12
12
NODES
Private Shared
MEM MEM MEM MEM MEM MEM Mem Mem
with DME based on distributed shared memory without DME based on centralized memory
14
15. Demonstrator Performance
Performance(fps)
77
75 with DM E
51 QCIF(176x144)
50
25 without DM E
25 30 31
24 20 with DM E
13 CIF(352x288)
6 without DM E
7 7
3 6 9 node
15
16. Applications
The DME is useful for many-core SoCs in:
Video, signal and network processing
Cloud computing
Industrial automation
Set-top boxes
Scientific computing
Solid state disks
Other high-end embedded applications
16
18. Evaluating the DME
For evaluation of the DME, Elsip offers:
Introduction Booklet
DME Application Development
Package, with API libraries
C++ Model
SIMICS Model
Compiled IP Model
User manual
Demonstrator
On-site and off-site support
19. Roadmap
Looking into the future, other IP we’re working on
include:
Packet- and Circuit-switched NoCs (Circuit-switched
can be faster than packet-switched for
telecom/datacom applications)
DRRA - Dynamically Reconfgurable Resource Array
(reconfgurable on bus level, better silicon usage than
FPGA)
20. Thank you!
Please go to
www.elsip.se
for more information
21. The founders
Axel Jantsch, CTO. Professor, KTH Electronic Systems since
2002. 20+ years of research, primarily within NoC and SoC.
200+ scientific papers published. Visiting professor of Fudan
University in PRC and Cantabria University in Spain
Ahmed Hemani. Professor, KTH, focus on high-level system
integration, design automation, NoC, asynchronous circuit,
configurable system. Industrial experience from NSC,
NXP/Philips, ABB, Ericsson, Newlogic, Synthesia and
Spirea (co-founder).
Zhonghai Lu: Professor, KTH, expert in SoC and NoC.
Reviewer of 14 international periodicals. Principal
investigator of Intel, dealing with future nuclear processor
chip frame.
21
22. Contact
Sales Director Bengt Edlund
Mail: bengt@elsip.se
Phone: +46 708 722 800
CEO Adam Edström
Mail: adam@elsip.se
Phone +46 702 579 734
Address: c/o SICS, PO Box 1263, SE16429, Kista, Sweden
22
23. Some ELSIP Milestones
• Founded by professors Axel Jantsch, Ahmed Hemani
and Zhonghai Lu at the Royal Institute of Technology in
Stockholm 2011
• Received initial funding from Vinnova
• Commercial launch when Adam Edström (CEO) and
Bengt Edlund (Sales Dir) joined the company Sept 2012
• Established subsidiary Memcom in Shanghai March
2012, PRC, with Zhonghai Lu as CTO and Zhuo Zou as
CEO. Received initial funding from Wuxi government.
• Cooperation with Fudan-Wuxi Institute, Shanghai, PRC
• Selected by SICS, the Swedish Institute of Computer
Science, as member of SICS Startup Accelerator
23