gcl-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH 1/2] Fix libX11 early init causing mmap layout changed


From: Jinyang He
Subject: Re: [PATCH 1/2] Fix libX11 early init causing mmap layout changed
Date: Thu, 11 Jul 2024 15:38:08 +0800
User-agent: Mozilla/5.0 (X11; Linux loongarch64; rv:68.0) Gecko/20100101 Thunderbird/68.7.0

On 2024-07-11 09:03, Camm Maguire wrote:
Greetings!  In followup to my earlier query, can you please detail what
happens when resource limits are set before execve as you mention in
your comment?
For clarity, I'll briefly describe the mmap mechanism on Linux. It provides
two memory layouts usually, "arch_get_unmapped_{area, area_topdown}".
When user mode does mmap, it calls, (source based 6.10.0-rc6)
do_mmap->__get_unmapped_area->mm_get_unmapped_area_vmflags->arch_get_unmapped_{area, area_topdown}
It choose topdown or other by MMF_TOPDOWN bit.
When user mode does execve, it calls,
...->load_elf_binary->setup_new_exec->arch_pick_mmap_layout->...
The arch_pick_mmap_layout clear or set MMF_TOPDOWN bit by "mmap_is_legacy".
The CONFIG_STACK_GROWSUP is always false except parisc. Thus, if the
"rlim_stack->rlim_cur == RLIM_INFINITY" it always returns true and clear
the MMF_TOPDOWN bit. Once the MMF_TOPDOWN bit is set in do_execve, we
have little chance to change it in user mode.

The above details means, once the "rlim_stack->rlim_cur" set to -1ull before
execve. The new exec mmap layout is not topdown.
The quickly check cmd,
 "$ cat /proc/self/maps; ulimit -s unlimited; cat /proc/self/maps"

In another story, I'll give the interaction between gcl and libX11.
The call trace is follows, (libX11 with 1.8+ --enable-thread-safety-constructor)
#0  gcl_init_alloc (cs_start=0x7ffffffef9a8) at alloc.c:1203
#1  0x0000000000153df4 in malloc_internal (size=40) at alloc.c:1665
#2  0x0000000000153ed8 in malloc (size=40) at alloc.c:1682
#3  0x00007ffff7e63474 in XInitThreads () at /usr/src/debug/libx11/libX11-1.8.7/src/locking.c:614 #4  0x00007ffff7fbdc84 in call_init (l=<optimized out>, argc=1, argv=0x7ffffffefa98, env=0x7ffffffefaa8) at dl-init.c:74 #5  call_init (l=<optimized out>, argc=<optimized out>, argv=0x7ffffffefa98, env=0x7ffffffefaa8) at dl-init.c:26 #6  0x00007ffff7fbddac in _dl_init (main_map=0x7ffff7fed360, argc=1, argv=0x7ffffffefa98, env=0x7ffffffefaa8) at dl-init.c:121 #7  0x00007ffff7fd1bb8 in _start () from /lib64/ld-linux-loongarch-lp64d.so.1
The gcl depends on libX11.so so when load libX11.so, it calls XInitThreads.

In the past, the order of call gcl_init_alloc is
  main->execve->main->gcl_init_alloc(set rlimit)
Now the order is
  XInitThreads(gcl_init_alloc(set rlimit))->main->execve->main...
In short, the glibc::malloc is preempted by gcl::malloc and causing
earlier set stack rlimit before doing execve.

When make raw_pre_gcl to saved_pre_gcl, the raw_pre_gcl calls
gcl_init_alloc due to "gcl_alloc_initialized" is false and then
the mmap memory layout changed after execve. When exec saved_pre_gcl,
the "gcl_alloc_initialized" is true so mmap memory layout is not changed.
The "libboot.so" has been mapped to diffrent virtuall address,
finally the function call is broken.


  Isn't this still the case with this patch?
This patch restore the stack limits to keep the same stack limits after
execve itself. It is some like we should keep the same environments except
setting "GCL_UNRANDOMIZE" and setting "ADDR_NO_RANDOMIZE". However, the
stack limits has been changed before execve, it changes the environments.
  Rather the
patch appears to intend to reassert this setting immediately before
execve.

Take care,

Greetings, and thanks so much again for these patches!
Can you please give me a bit of explanation on this one?  What alters
the resource limits between gcl_init_alloc 'before main' and main
itself?
The raw_pre_gcl calling gcl_init_alloc but saved_pre_gcl not calling it
when do initcall(XInitThreads), it happens before do execve.
  Would this patch work just as well if the saved resource limits
were restored in the second invocation of gcl_init_alloc from main
instead of immediately prior to execve in unrandomize.h?
Badly, the mmap memory layout is determined at execve.

Thanks,
Jinyang
Take care,
Jinyang He <hejinyang@loongson.cn> writes:

After libX.so release 1.8 with `--enable-thread-safety-constructor`,
the libX.so calls `malloc` indirectly by calling XInitThreads() in
init_array. The raw_ preempts `malloc` in EXE and calls gcl_init_alloc()
before main. It set rlimit_stack before execve itself, and in linux
kernel this action changes the mmap way from topdown to downtop
(in many archs). Thus, saved the rlimit_stack if before_main and
restore it if it need re-execve itself.
---
  gcl/h/unrandomize.h | 16 ++++++++++++++++
  gcl/o/alloc.c       | 10 ++++++++++
  gcl/o/main.c        |  2 ++
  3 files changed, 28 insertions(+)

diff --git a/gcl/h/unrandomize.h b/gcl/h/unrandomize.h
index 04b7c507e..9ca66d0a9 100644
--- a/gcl/h/unrandomize.h
+++ b/gcl/h/unrandomize.h
@@ -5,6 +5,9 @@
  #include <string.h>
  #include <alloca.h>
  #include <errno.h>
+#if defined(__linux__) && defined(RLIMIT_STACK)
+#include <sys/resource.h>
+#endif
{
    errno=0;
@@ -53,6 +56,19 @@
        errno=0;
  #ifdef HAVE_GCL_CLEANUP
        gcl_cleanup(0);
+#endif
+#if defined(__linux__) && defined(RLIMIT_STACK)
+       {
+         extern struct rlimit rl_stack_saved;
+         /* Reset the rlim_cur incase*/
+         if (rl_stack_saved.rlim_cur != 0 && rl_stack_saved.rlim_max != 0)
+           if(setrlimit(RLIMIT_STACK, &rl_stack_saved)) {
+             printf("restore rlimit_stack failure %d\n",errno);
+             exit(-1);
+           }
+         rl_stack_saved = (struct rlimit){0, 0};
+         errno=0;
+       }
  #endif
        execve(*a,a,n);
        printf("execve failure %d\n",errno);
diff --git a/gcl/o/alloc.c b/gcl/o/alloc.c
index 8620df495..4d29ec860 100644
--- a/gcl/o/alloc.c
+++ b/gcl/o/alloc.c
@@ -1189,6 +1189,10 @@ init_tm(enum type t, char *name, int elsize, int nelts, 
int sgc,int distinct) {
     simplicity.  set_maxpage is overloaded, and the positioning of its
     call is too fragile.  20050115 CM*/
  static int gcl_alloc_initialized;
+#if defined(__linux__) && defined(RLIMIT_STACK)
+extern int before_main;
+struct rlimit rl_stack_saved;
+#endif
object malloc_list=Cnil; @@ -1238,6 +1242,12 @@ gcl_init_alloc(void *cs_start) {
      }
massert(!getrlimit(RLIMIT_STACK, &rl));
+#ifdef __linux__
+    if (before_main)
+      rl_stack_saved = rl;
+    else
+      rl_stack_saved = (struct rlimit){0, 0};
+#endif
      if (rl.rlim_cur!=RLIM_INFINITY && (rl.rlim_max == RLIM_INFINITY || 
rl.rlim_max > rl.rlim_cur)) {
        rl.rlim_cur = rl.rlim_max; /* == RLIM_INFINITY ? rl.rlim_max : 
rl.rlim_max/64; */
        massert(!setrlimit(RLIMIT_STACK,&rl));
diff --git a/gcl/o/main.c b/gcl/o/main.c
index 6621c3a16..be241d1af 100644
--- a/gcl/o/main.c
+++ b/gcl/o/main.c
@@ -574,8 +574,10 @@ 
DEFUN("KCL-SELF",object,fSkcl_self,SI,0,0,NONE,OO,OO,OO,OO,(void),"") {
} +int before_main=1;
  int
  main(int argc, char **argv, char **envp) {
+  before_main=0;
GET_FULL_PATH_SELF(kcl_self);
    *argv=kcl_self;



reply via email to

[Prev in Thread] Current Thread [Next in Thread]