Automated reduction of attack surface using call graph enumeration

Automated reduction of attack surface
using call graph enumeration
Ruo Ando
National Institute of Informatics, Japan
2018 2nd International Conference on
Management Engineering, Software
Engineering and Service Sciences (ICMSS)
Jan 13-15, 2018 in Wuhan, China
SESSION 1: Software Engineering and System Modeling
13:30 – 13:45

Attack surface
Vulnerable function
CVE-2013-4371
Use-after-free Xen Hypervisor
under high memory pressure
What is an attack surface ?
The Attack Surface of an application is:
1.the sum of all paths for data/commands
into and out of the application, and
2.the code that protects these paths
(including resource connection and
authentication, authorization, activity
logging, data validation and encoding), and
3.all valuable data used in the application,
including secrets and keys, intellectual
property, critical business data, personal
data and PII, and
4.the code that protects these data
(including encryption and checksums,
access auditing, and data integrity and
operational security controls).
For simplification, in this paper,
we define attack surface as function
call graph from malicious input
to vulnerable function.
https://www.owasp.org/index.php/Attack_Surface_Analysis_Cheat_Sheet
Malicious input

Abstract: Attack surface reduction
In this paper, we propose a method of
attack surface reduction using
enumeration of call graph.
① Proposal system is divided into two
steps: enumerating edge E[Function Fi,
Function Fi+1] and constructing call graph
by recursive search of [E1, E2, En].
② Proposed method enables us to find the
sum of paths of which leaf node is
vulnerable function VF. Also, root node RF
of call graph is part of program which is
open to attacker.
③ Therefore, call graph [VF, RF] can be
eliminated according the situation where
the program is running.
We apply proposal method to the real
programs (Xen) and extracts the attack
surface of CVE-2013-4371.
Malicious input
vulnerability

TARGET: Xen Open Source Hypervisor Software
# global -t cmdtable_lookup
cmdtable_lookup tools/libxl/xl_cmdtable.c 390
INPUT
20struct cmd_spec cmd_table[] = {
34 { "list",
35 &main_list,
36 "List information about all/some domains",
37 "[options] [Domain]¥n",
38 "-l, --long Output all VM details¥n"
39 "-v, --verbose Prints out UUIDs",
40 },
134 { "migrate-receive",
135 &main_migrate_receive,
136 "Restore a domain from a saved state",
137 "- for internal use only",
138 },
This attack surface can be eliminated (reduced).
https://www.xenproject.org/
VULNERABLE FUNCTION
libxl_cpupoolinfo * libxl_list_cpupool(libxl_ctx *ctx, int *nb_pool)

TARGET: CVE-2013-4371
Use-after-free Xen Hypervisor
402 tmp = realloc(ptr, (i + 1) * sizeof(libxl_cpupoolinfo));
388libxl_cpupoolinfo * libxl_list_cpupool(libxl_ctx *ctx, int *nb_pool)
389{
390 libxl_cpupoolinfo *ptr, *tmp;
397 poolid = 0;
398 for (i = 0;; i++) {
399 info = xc_cpupool_getinfo(ctx->xch, poolid);
400 if (info == NULL)
401 break;
402 tmp = realloc(ptr, (i + 1) * sizeof(libxl_cpupoolinfo));
403 if (!tmp) {
404 LIBXL__LOG_ERRNO(ctx, LIBXL__LOG_ERROR, "allocating cpupool info");
405 free(ptr);
406 xc_cpupool_infofree(ctx->xch, info);
407 return NULL;
408 }
409 ptr = tmp;
410 ptr[i].poolid = info->cpupool_id;
411 ptr[i].sched_id = info->sched_id;
412 ptr[i].n_dom = info->n_dom;
413 if (libxl_cpumap_alloc(ctx, &ptr[i].cpumap)) {
415 break;
416 }
417 memcpy(ptr[i].cpumap.map, info->cpumap, ptr[i].cpumap.size);
418 poolid = info->cpupool_id + 1;
420 }
realloc use-after-free vulnerability
Use-after-free vulnerability in the
libxl¥_list_cpupool function in the libxl
toolstack library in Xen 4.2.x and 4.3.x,
when running "under memory pressure,"
returns the original pointer when the
realloc function fails, which allows local
users to cause a denial of service (heap
corruption and crash) and possibly
execute arbitrary code via unspecified
vectors.
At line 402, Xen uses realloc for reallocating
the memory. Note that the address of
libxl_cpupoolinfo is already assigned outside of
this routine. Under high pressure, realloc can
not extend the memory from the original
pointer which is already obtained. in this case,
realloc newly yielding the address which
remaining the data to be written.
FOR LOOP without boundary
condition is dangerous
*tmp is return value of realloc

User-after-free (aka heap spray) : CVE-2013-4371 Xen hypervisor
http://blog.tempest.com.br/breno-cunha/perspectives-on-exploit-development-and-cyber-
attacks.html
create()
free()
realloc() use()
Jump to payload
on heap
ROP + Shellcode
int *tmp = (int*)realloc(k,(N+1)*sizeof(int));
if( tmp!=NULL ){
k = tmp;
puts("________realloc(k)_________");
for(i=0; i<N+1; i++){
printf("&k[%d]:%p ,
k[%d]=%d¥n",i,&k[i],i,k[i]);
}
}
realloc() allocates new area if realloc() fails to expand
the pointer of first argument.
Consequently, the string is saved but the address is changed.
If too many realloc() is invoked, DoS is enabled by adversary and an arbitrary
malicious string can be executed.
void* realloc (void* ptr, size_t size);

① enumerating function call graph of routine(R[I], N[J], F[K])
② enumerating invocation list(I[O], M[P], F[Q])
③generating list (R,N) for each F
F[k] {(R[i1], N[j1]), (R[i2], N[j2]) . . }
④ For each invocation list（I[O], M[P], F[K]),
Detecting a position of S[x] of M[P] in list F[k]{R[i]N[j]..} with binary search
⑤ Concatenating （R[I], S[x-1], F[k]) and (I[O],F[Q]) of S[x-1]
⑥ Generating a list of E({(R[i], S[x-1], F[k]), （I[O], M[P], F[K])} ...) by repeating ④ and ⑤.
E is enumerable. That is, ⑥ should be finished in finite steps.
⑦ Enumerating CG (Call Graph) of E({(R[i], S[x-1], F[k]),（I[O], M[P], F[K])} ...) from vulnerable function to malicious input.
Exit Condition １：node is EMPTY
Exit Condition２：node is MAIN
https://github.com/RuoAndo/Saturator/blob/master/postgres/ex.py
Algorithm of detecting attack surface

node invocation edge H（時） M（分） S（秒）
xen401 1111 193297 7149 2 45 48
xen451 1792 406859 11513 6 5 27
xen420 1542 344695 9566 5 3 49
xen434 1630 367031 10077 5 28 54
xen403 1123 193480 7191 2 44 41
xen461 1783 435286 11795 6 41 25
xen441 1676 389811 10516 5 56 40
xen342 907 163628 5070 2 27 8
xen410 1302 195986 7977 2 54 28
xen343 908 163832 5082 2 26 36
xen453 1795 407036 11546 6 16 7
xen464 1783 436076 11809 6 34 9
xen341 906 163088 5036 2 28 40
xen412 1309 196290 8008 2 54 49
xen415 1384 197232 8560 2 57 32
xen471 2281 466237 16291 7 13 20
xen413 1310 196503 8024 2 56 9
xen340 906 1628849 7250 30 11 28
xen442 1679 389955 10554 5 47 17
xen480 2299 442614 15769 7 51 48
xen423 1550 345345 9670 5 12 36
Enumerating (counting all) node, invocation and edge in source code
200: Function B
Function A @ FILE X
cmdtable_lookup tools/libxl/xl_cmdtable.c 390
Function B @ FILE Y
node
invocation
edge

version search depth
EMPTY
(LEAF)
EDGE version search depth
EMPTY
(LEAF)
EDGE
xen440 1 1 15xen433 1 1 7
xen440 2 6 62xen433 2 5 36
xen440 3 68 876xen433 3 74 998
xen412 1 0 8xen453 1 1 15
xen412 2 1 17xen453 2 6 65
xen412 3 10 119xen453 3 11 208
xen414 1 0 8xen432 1 1 7
xen414 2 1 17xen432 2 5 36
xen414 3 10 119xen432 3 88 1223
xen410 2 1 17xen450 1 1 15
xen410 3 10 122xen450 2 6 65
xen441 1 1 15xen450 3 11 203
xen441 2 6 62xen451 1 1 15
xen441 3 83 1068xen451 2 6 65
xen415 1 0 8xen451 3 10 201
xen415 2 1 17xen434 1 1 7
xen415 3 10 119xen434 2 5 36
xen422 1 0 8xen434 3 81 1043
xen422 2 3 36xen455 1 1 15
xen422 3 87 1525xen455 2 6 65
xen424 1 0 8xen455 3 12 217
xen424 2 3 36xen430 1 1 7
xen424 3 80 1476xen430 2 5 36
xen413 1 0 8xen430 3 62 761
xen413 2 1 17xen442 1 1 15
xen413 3 10 119xen442 2 6 62
xen425 1 0 8xen442 3 68 876
xen440 :
libxl_list_cpupool : 3 :
68 : 876
psycopg2.Operational
Error: FATAL: sorry,
too many clients
already
FATAL: sorry, too
many clients already
xen440 :
297 : 4811
xen414 :
10 : 119
xen414 :
10 : 123
xen414 :
10 : 127
LIMITATION: Upper bound of PostgreSQL: search depth > 4 (path explotion)

Conclusion: automated attack surface reduction is possible !
In this paper, we propose a method of
attack surface reduction using
enumeration of call graph.
① Proposal system is divided into two
steps: enumerating edge E[Function Fi,
Function Fi+1] and constructing call graph
by recursive search of [E1, E2, En].
② Proposed method enables us to find the
sum of paths of which leaf node is
vulnerable function VF. Also, root node RF
of call graph is part of program which is
open to attacker.
③ Therefore, call graph [VF, RF] can be
eliminated according the situation where
the program is running.
Malicious input
vulnerability
Proposal method can detect attack surface in major
open source software such as xen CVE-2013-4371 in
feasible computing time (2-7 hours).

Automated reduction of attack surface using call graph enumeration

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Automated reduction of attack surface using call graph enumeration

Similar to Automated reduction of attack surface using call graph enumeration (20)

More from Ruo Ando

More from Ruo Ando (20)

Recently uploaded

Recently uploaded (20)

Automated reduction of attack surface using call graph enumeration