26. •
• Schrödinger Dirac
•
Schrödinger Dirac
2018.8.7
"The fundamental laws necessary for the mathematical treatment of large parts of physics
and the whole chemistry are thus fully known, and the difficulty lies only in the fact that
application of these laws leads to equations that are too complex to be solved.”
- Paul Dirac, Proc. Roy. Soc. London , A123, 714 1929
HΨ = EΨ
H = −
∇A
2
2MA
N
∑ −
∇i
2
2A
n
∑ +
ZAZB
RABA>B
∑ −
ZA
rAiA,i
∑ +
1
riji> j
∑
Ψ ≡ Ψ r1,r2,,rn;R1,R2,,RN( ) Erwin SchrödingerEH
BA AB
BA
ji ijiA Ai
A
n
i
i
N
A
A
R
ZZ
rr
Z
M
H
,,
22
1
22
),,,;,,,( 2121 Nn RRRrrr
e2/rij
ZAe2/riA
i
j
A B
ZBe2/rjB
ZA ZB e2/RAB
N n
?!
?!
50. Resolution-of-idendity MP2 (RI-MP2)
2018.8.7
• MP2 4
Resolution-of-identity (RI)
• MP2 ( )
5-10 ( O(N5) )
•
: O(N3)
ia jb( )= Cσ b
Cνa
Cλ j
µν λσ( )Cµi
µ
∑
λ
∑
ν
∑
σ
∑ ia jb( )= Bn
ia
Bn
jb
n
∑ , Bn
ia
= l n( )
−1/2
Cνa
Cµi
µν l( )µ
∑
ν
∑
l
∑
O(N5)
RI
51. E(2)
=
ia jb( ) 2 ia jb( )− ib ja( )"
#
$
%
εi +εj −εa −εbijab
∑MP2
(2) MO Bn
ia
= l n( )
−1/2
Cνa Cµi µν l( )
µ
∑
ν
∑
l
∑
(1) AO3 µν l( )= χµ (r1)χµ (r1)r12
−1
ξl (r2 )dr1 dr2∫∫
Hartree-Fock Cµi εi
(3) MO ia jb( )= Bn
ia
Bn
jb
n
∑
: O(N5)
O(N3)
-
-
: O(N4)
: O(N4)
: O(N3)
2018.8.7
52. RI-MP2
2018.8.7
IoccBgn = Myrank*Nchank
IoccEnd = (Myrank+1)*Nchank
Do Iocc = IoccBgn, IoccEnd
↑MPI parallelization
<RI-MP2 calculations>
End Do
IvirBgn = Myrank*Nchank
IvirEnd = (Myrank+1)*Nchank
Do Ivir = IvirBgn, IvirEnd
↑MPI parallelization
<RI-MP2 calculations>
End Do
(ia | jb) = Bn
ia
Bn
jb
n
∑
:
MPI
:
MPI
(ia | jb) = Bn
ia
Bn
jb
n
∑
RI-MP2
MPI/OpenMP
MK and T. Nakajima, J. Chem. Comput. Theor., 9, 5373 (2013).
53. RI-MP2
2018.8.7
0
1
2
3
4
N-1
:
MPI
MPI ( =1 )
Evaluation of 3c-2e ERIs Bn
ia
= l n( )
−1/2
Cνa
Cµi
µν l( )µ
∑
ν
∑
λ
∑
Loop bProc
Sending
jb
nB to Myrank- bProc process
Receiving
jb
nB from Myrank+ bProc process
Loop a Myrank (MPI parallel)
Evaluation of 4c-2e MO ERIs
(ia | jb) = Bn
ia
Bn
jb
n
∑
by BLAS’s DGEMM (OpenMP parallel)
Evaluation of MP2 correlation energy E(2)
(OpenMP parallel)
End Loop a
End Loop bProc
MPI ( )
MK et al. J. Chem. Theory Comput., 2013, 9, 5373.
56. RI-MP2 CUDA CPU-GPU
2018.8.7
Loop bProc (MPI parallel)
Sending
jb
nB to Myrank- bProc + 1 process
Sending
jb
nB from CPU to GPU
Receiving
jb
nB from Myrank+ bProc + 1 process
Sending
jb
nB from CPU to GPU
Loop a Myrank (MPI parallel)
Receiving 4c-2e integral (ia | jb)P from GPU to CPU
Allreduce 4c-2e integral
(ia | jb) = (ia | jb)P
P
∑
Evaluation of MP2 correlation energy E(2)
(OpenMP parallel)
End Loop a
End Loop bProc
• GPU CUDA
•
- CuBLAS
GPU
Evaluation of 4c-2e integral
(ia | jb)P
= Bn
ia
Bn
jb
n∈Myrank
∑
(CuBLASDGEMM)
: CPU
: GPU
• CPU-GPU PCI
• pinned memory PCI
64. RI-MP2
• 1997 Weigend *
• MP2 4 MO
Resolution-of-identity (RI)
• MP2
•
• Turbomole Q-Chem ORCA
* F. Weigend, M. Häser, Theor. Chem. Acc., 97, 331 (1997).
1 1 , 1( ) ( ) ( )l l
l
cµ n µnc c x» år r r
2018.8.7
µν λσ( )= µν l( ) l m( )
−1
m λσ( )lm
∑
µν λσ( )
(x)
= µν l( )
(x)
l m( )
−1
m λσ( )lm
∑ + µν l( ) l m( )
−1
m λσ( )
(x)
lm
∑
− µν l( ) l m( )
−1
m n( )
(x)
n o( )
−1
o λσ( )lmno
∑
65. RI-MP2
RI-MP2
2
γ lm
MP2-NS
= Γia
l
Bia
n
m n( )
−1/2
nia
∑Γia
l
= 2Tij
ab
−Tji
ab
( )Bjb
n
n l( )
−1/2
jbn
∑
Pab
(2)
= 2Tij
ac
−Tji
ac
( )Tij
bc
ijc
∑
1
Γµνλσ
MP2-S
= 1
2
Pµν
RHF
+ Pµν
(2)
( )Pλσ
RHF
− 1
2
1
2
Pµλ
RHF
+ Pµλ
(2)
( )Pνσ
RHF
Bia
n
= l n( )
−1/2
Cνa Cµi µν l( )
µ
∑
ν
∑
l
∑ ia jb( )= Bia
n
Bjb
n
n
∑
3
Pij
(2)
= − 2Tik
ab
−Tki
ab
( )kab
∑ Tjk
ab
Tij
ab
=
(ia | jb)
εi
+ ε j
− εa
− εb
dEMP2
dx
= Pµν
MP2
Hµν
(x)
+ Wµν
MP2
Sµν
(x)
µν
∑ +
1
2
Γµνλσ
MP2-S
µν λσ( )
(x)
µν
∑ + 2 γ lm
MP2-NS
l m( )
(x)
lm
∑µν
∑ + 4 Γµν
l,MP2-NS
µν l( )
(x)
µν
∑
4
Lai
= 2 Γib
l
ab l( )bl
∑ − 2 Γ ja
l
ij l( )+ Aaibc
Pbc
(2)
bc
∑ + Aaijk
Pjk
(2)
jk
∑
jl
∑MP2
MP2
CPHF
occ.-occ. vir.-vir.
Pai
(2)
= Zai
occ.-vir.
Pµν
(2)
= Ppq
(2)
Cµp
Cνq
pq
∑Pµν
MP2
= Pµν
HF
+ Pµν
(2)
Γµν
l,MP2-NS
= Γia
l
Cµi
Cνa
ia
∑
Aaipq
Ppq
(2)
pq
∑ + εa
− εi( )Zai
= Lai
Wij
(2)
= 2 Γia
l
ja l( )
al
∑ −
1
2
Pij
(2)
εi + εj( )+ AijpqPpq
(2)
pq
∑
occ.-ccc. vir.-vir.
Wab
(2)
= −2 Γia
n
ib n( )
an
∑ −
1
2
Pab
(2)
εa + εb( ) Wai
(2)
= −2 Γ ja
n
ij n( )
ja
∑ − Pai
(2)
εi
occ.-vir.
Apqrs
= 4 pq rs( )− ps rq( )− pr sq( )
RI
2018.8.7
66. Evaluate 2p DM Yia
n
+ = 2Tij
ab
!Tji
ab
( )Bjb
n
jb
"
End Loop bProc
Evaluate 1p DM Pab
(2)
+ = 2Tij
ac
! Tji
ac
( )Tij
bc
ij
"
Evaluate 2p DM !ia
l
= Yia
n
n l( )
"1/2
n
#
Store !ia
l
to distributed memory
Evaluate 2p DM Xln
+ = !ia
l
Bia
n
ia
"
End Loop a
Allreduce E(2)
, Pij
(2)
, Pab
(2)
, and Xln
Evaluate 2p DM ! ln
= Xlm
m n( )
"1/2
n
#
Evaluate non-separable part of MP2 gradient
dEMP2
dx
+ = 2 ! lm
l m( )
(x)
lm
!
[Step 1: Evaluate 3c integrals]
Evaluate Bia
n
= l n( )
!1/2
C!a Cµi µ! l( )
µ
"!
"l
"
Store Bia
n
to distributed memory
[Step 2: Evaluate 1p & 2p MP2 density Matrix (DM) and
part of gradient]
Loop a Myrank (MPI parallel)
Loop bProc = 1, NProc
Sending
jb
nB to Myrank- bProc process
Receiving
jb
nB from Myrank+ bProc process
Evaluate 4c-2e integral (ia | jb) = Bia
n
Bjb
n
n
!
Evaluate Tij
ab
=
(ia | jb)
!i
+ ! j
! !a
! !b
Evaluate MP2 correlation energy E(2)
Evaluate 1p MP2 DM Pij
(2)
+ = ! 2Tik
ab
!Tki
ab
( )k
! Tjk
ab
MPI/OpenMP
RI-MP2 (1)
• MPI :
• 2 :
MPI
• OpenMP : MPI
• - BLAS DGEMM
: MPI
: OpenMP
MPI/OpenMP
:
3
2018.8.7
O(N5) BLAS DGEMM
67. [Step 3: Evaluate part of MP2 Lagrangian and gradient]
Loop bProc = 1, NProc
Sending !ia
l
to Myrank- bProc process
Receiving !ia
l
from Myrank+ bProc process
End Loop bProc
Loop L Myrank (MPI Parallel)
Evaluate MP2 Lagrangian Laq
3
+ = !ia
l
C"q
Cµi
µ" l( )µ
#
"
#
i
#
Evaluate MP2 Lagrangian Liq
4
+ = !ia
l
C!q
Cµa
µ! l( )µ
"!
"i
"
Evaluate 1p energy weighted DM
Wij
(2)
[I]+ = 2 !ia
l
ja l( )i
"
Evaluate non-separable part of MP2 gradient
dEMP2
dx
+ = 4 !µ!
l
µ! l( )
(x)
µ!
"
End Loop L
Allreaduce Laq
3
, Liq
4
, and , Wij
(2)
[I]
[Step 4: Evaluate part of MP2 Lagrangian]
Lai
1,2
= Cµi
C! j
2 µ! "#( )$ µ" !#( )%
&
'
( )P"#
(2)
"#
*
!
*
µ
*
[Step 5: Iteratively Solve CPHF equation]
Loop iter
Evaluate
Gai
= Cµa
C!i
4 µ! "#( )! µ! "#( )! µ! "#( )"
#
$
%
&&P!"
(2)
!"
'#
'µ
'
Solve CPHF equation Gai
+ !a
" !i( )Pai
(2)
= Lai
End Loop iter
[Step 6: Evaluate part of MP2 energy weighted DM]
Wij
(2)
[III] = Cµi
C! j
2 µ! "#( )! µ! "#( )"
#
$
%
P!"
(2)
!"
&#
&µ
&
[Step 7: Evaluate separable part of MP2 gradient]
dEMP2
dx
+ = Pµ!
MP2
Hµ!
(x)
+ Wµ!
MP2
Sµ!
(x)
µ!
! +
1
2
"µ!"#
MP2-S
µ! "#( )
(x)
µ!
!µ!
!
2 Fock MPI/OpenMP
AO
MPI/OpenMP
RI-MP2 : (2)
3 2
1 1
: MPI
: OpenMP
MPI/OpenMP
:
2018.8.7
71. (TDDFT)
•
• 1
2018.8.7
−
1
2
∇2
+υne
R,r( )+
ρ r,t( )
r − #r
d #r∫ +υxc
r,t( )+υ r,t( )
%
&
'
'
(
)
*
*
ψ r,t( )= i
∂
∂t
ψ r,t( )( )
( )
( ) ( ) ( )
1
,
2
ne xc i i id
r
u u e
é ù¢
¢- Ñ + + + Y = Yê ú
¢-ë û
ò
r
R r r r r r
r r
( )1r r( )1 2, , , NY r r r! ( )1,tr r( )1 2, , , ,N tY r r r!
(DFT) (TDDFT)
Kohn-Sham Kohn-Sham
72. (LR-TDDFT)
• TDDFT 1
LR-TDDFT
• ( O(N5-) )
O(N3)
•
1000
2018.8.7
A B
B*
A*
!
"
##
$
%
&&
X
Y
!
"
#
$
%
& =ω 1 0
0 −1
!
"
#
$
%
&
X
Y
!
"
#
$
%
&
)||()|()|(
)||()|()|(
)(
,
,
ttstsssttsttssss
ttstsssttsstttss
tsstss
d
eeddd
jbwiacibjacjbiaB
bjwiacijbacbjia
A
DFTHFbjai
DFTHF
iaabijbjai
+-=
+-
+-=
Casida
w:
73. LR-TDDFT
2018.8.7
t
AO
g+=(A+B)t, g-=(A-B)t
(AR-BR)1/2 (AR+BR) (AR-BR)1/2 ZR=w2ZR
WL=(A+B)R-wL, WR=(A-B)L-wR
t
W
W
90%
: O(N4)
SCF MPI/OpenMP
MPI/OpenMP
OpenMP
2, 3BLAS
t
AR+BR=t+g+, AR-BR=t+ g
-
AO Davidson
ZR:
L=|X-Y>
&
R =|X+Y>
OpenMP
Bµν
q!
"
#
$
= 2 µν λσ( )−cx
µλ νσ( )+cx
LR
µλ νσ( )
LR
+ fµν ,λσ
xc!
"&
#
$'tλσ
q!
"
#
$
MK and T. Nakajima, J. Comput.
Chem., 2017, 38, 489.
85. • Rene Descartes 1596 – 1650
–
–
–
• 1
–
• (Discours de la méthode) 2
–
(Le second était de diviser chacune des difficultés que
j‘examinais en autant de parcelles qu’il se pourrait et qu‘il
serait requis pour mieux les résoudre
2018.8.7
90. FMO-RI-MP2
1 1 † 1
( | ) ( )ln nm
n
l m L L- - -
= å
1
( | ) ( | )( | ) ( | )
lm
l l m mµn ls µn ls-
» å
1 1 1 ,( ) ( ) ( ) m
m
m cµ n µnc c » år r r
Resolution-of-identity (RI)
FMO-MP2
⇒
RI
Resolution-of-identity (RI) FMO-RI-MP2
3 ⇒
o:
v:
n:
N :
3
⇒
( ) i a j b
n n
n
i a j b B B
a a a a
a a a a
= å
1
( | )nl
i a
n i a
l
B L C l C
a a
a a
µ n
µ n
µn-
= å å å1
( | )nl
i a
n a
l
B L l C
a a
a
n
n
µn-
= å å
2018.8.7
MK Theor. Chem. Acc., 2011, 130, 449–453.
91. FMO RI-MP2
:
968 15,719
2 /1
(484 )
FMO2 RI-MP2/6-31G*
12288 86016 0.82
GAMESS-FMO-MP2 MPI/OpenMP
( )
2018.8.7
MK, T. Nakajima, and S. Nagase
Proceedings of JSST 2012 338-343
93. W. Yang
• DC-DFT : W. Yang DFT O(N)
• DC-semi-empirical MO : W. Yang
( )
• DC-Hartree Fock : W. Yang DC-DFT ( ),
• DC-MP2 : ,
• DC-coupled cluster ,
• DC-TD-DFT : ,
• DC-DFTB (density functional tight binding) : ( ),
, ,
2018.8.7
94. DC-MP2
MP2
µ S(a)
MP2
MP2
µ S(a)
MP2
( )
occ( ) vir( )
corr
, , ( )
2i iajb ibja
i j a b S
E C a j b t t
a a
a a a a a a a
µ
µ a
µ
Î
é ù= -ë ûå å å ! !
subsystem
corr corrE Ea
a
» å
a MO i
( )
iajb
i j a b
i a j b
t
a a a a
a
a a a a
e e e e
=
+ - -
!
M. Kobayashi, Y. Imamura, and H. Nakai, J. Chem. Phys., 127. 074103, (2007).
2018.8.7