glibc的malloc hook機制試玩

glibc裡其實有實作了不少有趣的機制，我今天試著玩其中的一個小東西 - memory allocation hooks。

我們都知道，在C語言中若需要在執行期要到一塊大小為size的記憶體需要透過malloc(size)去獲得，但是malloc之後必須在不用了以後正確釋放，否則就會有memory leak產生。用肉眼去觀察malloc/free的流程是非常辛苦的一件事，尤其當我們接手別人的程式時，要確認這件事更是困難重重。當然，有不少現成的分析工具可以使用，不過我愈來愈想知道那些工具的可能實作手法，或許必要時就可以自己做一個客制化的小工具。在這種想法下，我在這篇文章發現了memory allocation hooks(嗯...以前從來沒有仔細看過glibc manual，沒想到glibc的特異功能還真不少 :P)，稍微想了一下，利用這個機制應該就可以完成一個簡易的memory leak detector，glibc對其相關介面的描述是：

__malloc_hook

The value of this variable is a pointer to the function that malloc uses whenever it is called. You should define this function to look like malloc; that is, like:
          void *function (size_t size, const void *caller)
The value of caller is the return address found on the stack when the malloc function was called. This value allows you to trace the memory consumption of the program.

__free_hook

The value of this variable is a pointer to function that free uses whenever it is called. You should define this function to look like free; that is, like:
          void function (void *ptr, const void *caller)
The value of caller is the return address found on the stack when the free function was called. This value allows you to trace the memory consumption of the program.

__malloc_initialize_hook

The value of this variable is a pointer to a function that is called once when the malloc implementation is initialized. This is a weak variable, so it can be overridden in the application with a definition like the following:
          void (*__malloc_initialize_hook) (void) = my_init_hook;

說明的很清楚，網頁還給出一個流程說明，我修改了一下讓其可以完整執行起來：

#include <stdio.h>
/* Prototypes for __malloc_hook, __free_hook */
#include <malloc.h>
/* Prototypes for our hooks. */
static void my_init_hook (void);
static void *my_malloc_hook (size_t, const void *);
static void my_free_hook (void*, const void *);
/* two function pointer to save old hooks */
static void *(*old_malloc_hook)(size_t, const void *);
static void (*old_free_hook)(void *, const void *);
/* Override initializing hook from the C library. */
void (*__malloc_initialize_hook) (void) = my_init_hook;
static void
my_init_hook (void)
{
old_malloc_hook = __malloc_hook;
old_free_hook = __free_hook;
__malloc_hook = my_malloc_hook;
__free_hook = my_free_hook;
}
static void *
my_malloc_hook (size_t size, const void *caller)
{
void *result;
printf("caller %pn", caller);
/* Restore all old hooks */
__malloc_hook = old_malloc_hook;
__free_hook = old_free_hook;
/* Call recursively */
result = malloc (size);
/* Save underlying hooks */
old_malloc_hook = __malloc_hook;
old_free_hook = __free_hook;
/* printf might call malloc, so protect it too. */
printf ("malloc (%u) returns %pn", (unsigned int) size, result);
/* Restore our own hooks */
__malloc_hook = my_malloc_hook;
__free_hook = my_free_hook;
return result;
}
static void
my_free_hook (void *ptr, const void *caller)
{
/* Restore all old hooks */
__malloc_hook = old_malloc_hook;
__free_hook = old_free_hook;
/* Call recursively */
free (ptr);
/* Save underlying hooks */
old_malloc_hook = __malloc_hook;
old_free_hook = __free_hook;
/* printf might call free, so protect it too. */
printf ("freed pointer %pn", ptr);
/* Restore our own hooks */
__malloc_hook = my_malloc_hook;
__free_hook = my_free_hook;
}
static void foo(void)
{
char *ptr = malloc(100);
*ptr = 0x99;
free(ptr);
}
int main(void)
{
foo();
foo();
}

不試還好，一試就發現執行結果跟我預期不符。我在my_malloc_hook將caller的address印出來看，結果發現兩次foo()去呼叫malloc()，卻出現不同結果：

[mars@dream malloc_free_hook]$ gcc -o test_simple test_simple.c
[mars@dream malloc_free_hook]$ ./test_simple
caller 0xb77bea13
malloc (100) returns 0x96c8008
freed pointer 0x96c8008
caller 0x804860a
malloc (100) returns 0x96c8008
freed pointer 0x96c8008

可以確認的是，第二次是對的，並且在第二次之後也都是對的(用objdunp -d即可確認foo中的malloc呼叫時，其PC是0x804860a)，那第一次的錯誤是怎麼回事？

想一想...耶！知道了！因為gcc預設是用dynamic linking去link libc的關係啊！

之前我翻譯過Eli大俠的共享庫動態載入機制中有提到，程式對動態共享庫的引用，第一次引用的路徑是不同於第二次的！第一次長這樣：

會由PLT[0]中的resolver去"call" malloc(所以return address是落在libc.so，可透過/proc/process_id/maps觀察得知)，然後在第二次呼叫時，才會直接跳到已被更新的GOT[n](此時是由call func@PLT過來的，所以return address落在我們的函式中)。第二次呼叫的流程長這樣：

有趣吧？:) 所以我們可以在main中malloc()/free()一次，當作第一次的malloc不會有memory leak問題，只看接下來的統計即可。換另個角度想，那麼只要用static link的方式就應該要能避掉這個問題嘍？我們來試一下：

[mars@dream malloc_free_hook]$ gcc -static -o test_simple test_simple.c
[mars@dream malloc_free_hook]$ ./test_simple
caller 0x804de54
malloc (20) returns 0x8bdbcd8
caller 0x8073803
malloc (2480) returns 0x8bdbcf0
caller 0x8048376
malloc (100) returns 0x8bdc6a8
freed pointer 0x8bdc6a8
caller 0x8048376
malloc (100) returns 0x8bdc6a8
freed pointer 0x8bdc6a8

看來我們在程式中呼叫的都一致了，但藍色的是哪裡多出來的？同樣地，我用objdump -d看一下，發現是在_dl_init_paths與__libc_malloc中：

08073790 <_dl_init_paths>:

8073790: 55 push %ebp

8073791: b9 dc 5f 0c 08 mov $0x80c5fdc,%ecx

8073796: 89 e5 mov %esp,%ebp

...

80737fe: e8 8d a4 fd ff call 804dc90 <__libc_malloc>
8073803: 89 03 mov %eax,(%ebx)

8073805: a1 e4 5f 0c 08 mov 0x80c5fe4,%eax

807380a: 8b 30 mov (%eax),%esi

807380c: 85 f6 test %esi,%esi

...

0804dc90 <__libc_malloc>:

804dc90: 55 push %ebp

804dc91: 89 e5 mov %esp,%ebp

804dc93: 83 ec 1c sub $0x1c,%esp

...

804de52: ff d0 call *%eax

804de54: 89 c6 mov %eax,%esi

804de56: e9 c5 fe ff ff jmp 804dd20 <__libc_malloc+0x90>

804de5b: 90 nop

804de5c: 8d 74 26 00 lea 0x0(%esi,%eiz,1),%esi

真是Orz...原來這個hook機制用在static link的情況時，似乎會有其他的初始動作，使得我們的hook會在還沒在main()之前就被用到。看來明天要來翻glibc的代碼來看看...

果然還是要動手試作一下才知道一堆眉眉角角...-_-

軟體學徒forever

搜尋此網誌

glibc的malloc hook機制試玩

標籤

留言

張貼留言

這個網誌中的熱門文章

淺讀Linux root file system初始化流程

誰在呼叫我？不同的backtrace實作說明好文章

中文試譯：Writing a game in Python with Pygame. Part I