qemu-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [PATCH v2] Add AVX, AVX-512, MPX support to x86_cpu_dum


From: Doug Gale
Subject: Re: [Qemu-devel] [PATCH v2] Add AVX, AVX-512, MPX support to x86_cpu_dump_state
Date: Thu, 7 Dec 2017 18:01:29 -0500

On Thu, Dec 7, 2017 at 5:37 PM, Richard Henderson <
address@hidden> wrote:

> On 12/02/2017 10:35 PM, Doug Gale wrote:
> > Signed-off-by: Doug Gale <address@hidden>
> > ---
> > Fix MSB LSB showing when SSE is disabled
> >  target/i386/helper.c | 95 ++++++++++++++++++++++++++++++
> +++++++++++++++-------
> >  1 file changed, 83 insertions(+), 12 deletions(-)
> >
> > diff --git a/target/i386/helper.c b/target/i386/helper.c
> > index f63eb3d3f4..03812b6e87 100644
> > --- a/target/i386/helper.c
> > +++ b/target/i386/helper.c
> > @@ -543,6 +543,7 @@ void x86_cpu_dump_state(CPUState *cs, FILE *f,
> fprintf_function cpu_fprintf,
> >          }
> >      }
> >      cpu_fprintf(f, "EFER=%016" PRIx64 "\n", env->efer);
> > +    cpu_fprintf(f, "XCR0=%016" PRIx64 "\n", env->xcr0);
> >      if (flags & CPU_DUMP_FPU) {
> >          int fptag;
> >          fptag = 0;
> > @@ -565,21 +566,91 @@ void x86_cpu_dump_state(CPUState *cs, FILE *f,
> fprintf_function cpu_fprintf,
> >              else
> >                  cpu_fprintf(f, " ");
> >          }
> > -        if (env->hflags & HF_CS64_MASK)
> > -            nb = 16;
> > -        else
> > +
> > +        if (env->hflags & HF_CS64_MASK) {
> > +            if (env->xcr0 & XSTATE_Hi16_ZMM_MASK) {
> > +                /* AVX-512 32 registers enabled */
> > +                nb = 32;
> > +            } else {
> > +                /* 64-bit mode, 16 registers */
> > +                nb = 16;
> > +            }
> > +        } else {
> > +            /* 32 bit mode, 8 registers */
> >              nb = 8;
> > -        for(i=0;i<nb;i++) {
> > -            cpu_fprintf(f, "XMM%02d=%08x%08x%08x%08x",
> > -                        i,
> > -                        env->xmm_regs[i].ZMM_L(3),
> > -                        env->xmm_regs[i].ZMM_L(2),
> > -                        env->xmm_regs[i].ZMM_L(1),
> > -                        env->xmm_regs[i].ZMM_L(0));
> > -            if ((i & 1) == 1)
> > +        }
> > +
> > +        /* sse register width in units of 64 bits */
> > +        int zmm_width;
> > +        char zmm_name;
> > +        if (env->xcr0 & XSTATE_ZMM_Hi256_MASK) {
> > +            /* 512-bit "ZMM" - AVX-512 registers enabled */
> > +            zmm_width = 8;
> > +            zmm_name = 'Z';
> > +        } else if (env->xcr0 & XSTATE_YMM_MASK) {
> > +            /* 256-bit "YMM" - AVX enabled */
> > +            zmm_width = 4;
> > +            zmm_name = 'Y';
> > +        } else if (env->cr[4] & CR4_OSFXSR_MASK) {
> > +            /* 128-bit "XMM" - SSE enabled */
> > +            zmm_width = 2;
> > +            zmm_name = 'X';
> > +        } else {
> > +            /* SSE not enabled */
> > +            zmm_width = 0;
> > +            zmm_name = 0;
>
> You should exit the function here ...
>
> > +        }
> > +
> > +        if (zmm_width > 0) {
> > +            cpu_fprintf(f, "      MSB%*sLSB\n",
> > +                        -(zmm_width * 16 + zmm_width - 7), "");
> > +        }
> > +
> > +        for (i = 0; i < nb; i++) {
> > +            if (zmm_width == 0) {
> > +                cpu_fprintf(f, "SSE not enabled\n");
> > +                break;
> > +            }
>
> ... rather than performing this test in a loop.
> Or indeed, printing anything at all.
>
> I assume this is in order to examine registers in the monitor under kvm?
>
>
> r~
>

​
Yes, to examine the registers in the monitor and improve the output of
things like -d int. I ran into an issue with
​ guest​
code and I wanted to check XCR0 to see if AVX was enabled, and I didn't see
any AVX registers
​, which was very misleading​
.
​ I couldn't check XCR0 to see if it was enabled either.​
I investigated and found that AVX was enabled, and the monitor output was
false. I added XCR0 and the entire AVX
​/AVX-512​
and MPX state to the
​qemu ​
output.

Why would it exit the function there? Are you sure the other states I
further down won't be enabled? It prints the MPX state further down. I put
the test in the loop so I wouldn't have to indent that whole `for` loop
another level, to improve readability. Is the performance of that loop
really relevant? The fprintf's will take at least thousands of times more
CPU time than that predictable branch.​


reply via email to

[Prev in Thread] Current Thread [Next in Thread]