[PATCH 4/5] x86-64: Intel64 adjustments for conditional jumps

Jan Beulich jbeulich@suse.com
Thu Jul 16 11:57:32 GMT 2020


On 16.07.2020 13:38, H.J. Lu wrote:
> On Thu, Jul 16, 2020 at 2:53 AM Jan Beulich <jbeulich@suse.com> wrote:
>>
>> On 15.07.2020 16:02, H.J. Lu wrote:
>>> On Tue, Jul 14, 2020 at 11:08 PM Jan Beulich <jbeulich@suse.com> wrote:
>>>>
>>>> On 14.07.2020 14:59, H.J. Lu wrote:
>>>>> On Tue, Jul 14, 2020 at 5:47 AM Jan Beulich <jbeulich@suse.com> wrote:
>>>>>>
>>>>>> On 14.07.2020 14:36, H.J. Lu wrote:
>>>>>>> On Tue, Jul 14, 2020 at 5:20 AM Jan Beulich <jbeulich@suse.com> wrote:
>>>>>>>>
>>>>>>>> On 14.07.2020 14:18, Jan Beulich wrote:
>>>>>>>>> On 14.07.2020 14:00, H.J. Lu wrote:
>>>>>>>>>> On Tue, Jul 14, 2020 at 3:13 AM Jan Beulich <jbeulich@suse.com> wrote:
>>>>>>>>>>> --- a/gas/testsuite/gas/i386/opcode-suffix.d
>>>>>>>>>>> +++ b/gas/testsuite/gas/i386/opcode-suffix.d
>>>>>>>>>>> @@ -305,22 +305,22 @@ Disassembly of section .text:
>>>>>>>>>>>   *[0-9a-f]+:   0f 77[  ]+emms[         ]+
>>>>>>>>>>>   *[0-9a-f]+:   0f 7e 90 90 90 90 90[   ]+movd[         ]+%mm2,-0x6f6f6f70\(%eax\)
>>>>>>>>>>>   *[0-9a-f]+:   0f 7f 90 90 90 90 90[   ]+movq[         ]+%mm2,-0x6f6f6f70\(%eax\)
>>>>>>>>>>> - *[0-9a-f]+:   0f 80 90 90 90 90[      ]+jo[   ]+909094e2 <foo\+0x909094e2>
>>>>>>>>>>> - *[0-9a-f]+:   0f 81 90 90 90 90[      ]+jno[  ]+909094e8 <foo\+0x909094e8>
>>>>>>>>>>> - *[0-9a-f]+:   0f 82 90 90 90 90[      ]+jb[   ]+909094ee <foo\+0x909094ee>
>>>>>>>>>>> - *[0-9a-f]+:   0f 83 90 90 90 90[      ]+jae[  ]+909094f4 <foo\+0x909094f4>
>>>>>>>>>>> - *[0-9a-f]+:   0f 84 90 90 90 90[      ]+je[   ]+909094fa <foo\+0x909094fa>
>>>>>>>>>>> - *[0-9a-f]+:   0f 85 90 90 90 90[      ]+jne[  ]+90909500 <foo\+0x90909500>
>>>>>>>>>>> - *[0-9a-f]+:   0f 86 90 90 90 90[      ]+jbe[  ]+90909506 <foo\+0x90909506>
>>>>>>>>>>> - *[0-9a-f]+:   0f 87 90 90 90 90[      ]+ja[   ]+9090950c <foo\+0x9090950c>
>>>>>>>>>>> - *[0-9a-f]+:   0f 88 90 90 90 90[      ]+js[   ]+90909512 <foo\+0x90909512>
>>>>>>>>>>> - *[0-9a-f]+:   0f 89 90 90 90 90[      ]+jns[  ]+90909518 <foo\+0x90909518>
>>>>>>>>>>> - *[0-9a-f]+:   0f 8a 90 90 90 90[      ]+jp[   ]+9090951e <foo\+0x9090951e>
>>>>>>>>>>> - *[0-9a-f]+:   0f 8b 90 90 90 90[      ]+jnp[  ]+90909524 <foo\+0x90909524>
>>>>>>>>>>> - *[0-9a-f]+:   0f 8c 90 90 90 90[      ]+jl[   ]+9090952a <foo\+0x9090952a>
>>>>>>>>>>> - *[0-9a-f]+:   0f 8d 90 90 90 90[      ]+jge[  ]+90909530 <foo\+0x90909530>
>>>>>>>>>>> - *[0-9a-f]+:   0f 8e 90 90 90 90[      ]+jle[  ]+90909536 <foo\+0x90909536>
>>>>>>>>>>> - *[0-9a-f]+:   0f 8f 90 90 90 90[      ]+jg[   ]+9090953c <foo\+0x9090953c>
>>>>>>>>>>> + *[0-9a-f]+:   0f 80 90 90 90 90[      ]+jol[  ]+909094e2 <foo\+0x909094e2>
>>>>>>>>>>> + *[0-9a-f]+:   0f 81 90 90 90 90[      ]+jnol[         ]+909094e8 <foo\+0x909094e8>
>>>>>>>>>>> + *[0-9a-f]+:   0f 82 90 90 90 90[      ]+jbl[  ]+909094ee <foo\+0x909094ee>
>>>>>>>>>>> + *[0-9a-f]+:   0f 83 90 90 90 90[      ]+jael[         ]+909094f4 <foo\+0x909094f4>
>>>>>>>>>>> + *[0-9a-f]+:   0f 84 90 90 90 90[      ]+jel[  ]+909094fa <foo\+0x909094fa>
>>>>>>>>>>> + *[0-9a-f]+:   0f 85 90 90 90 90[      ]+jnel[         ]+90909500 <foo\+0x90909500>
>>>>>>>>>>> + *[0-9a-f]+:   0f 86 90 90 90 90[      ]+jbel[         ]+90909506 <foo\+0x90909506>
>>>>>>>>>>> + *[0-9a-f]+:   0f 87 90 90 90 90[      ]+jal[  ]+9090950c <foo\+0x9090950c>
>>>>>>>>>>> + *[0-9a-f]+:   0f 88 90 90 90 90[      ]+jsl[  ]+90909512 <foo\+0x90909512>
>>>>>>>>>>> + *[0-9a-f]+:   0f 89 90 90 90 90[      ]+jnsl[         ]+90909518 <foo\+0x90909518>
>>>>>>>>>>> + *[0-9a-f]+:   0f 8a 90 90 90 90[      ]+jpl[  ]+9090951e <foo\+0x9090951e>
>>>>>>>>>>> + *[0-9a-f]+:   0f 8b 90 90 90 90[      ]+jnpl[         ]+90909524 <foo\+0x90909524>
>>>>>>>>>>> + *[0-9a-f]+:   0f 8c 90 90 90 90[      ]+jll[  ]+9090952a <foo\+0x9090952a>
>>>>>>>>>>> + *[0-9a-f]+:   0f 8d 90 90 90 90[      ]+jgel[         ]+90909530 <foo\+0x90909530>
>>>>>>>>>>> + *[0-9a-f]+:   0f 8e 90 90 90 90[      ]+jlel[         ]+90909536 <foo\+0x90909536>
>>>>>>>>>>> + *[0-9a-f]+:   0f 8f 90 90 90 90[      ]+jgl[  ]+9090953c <foo\+0x9090953c>
>>>>>>>>>>>   *[0-9a-f]+:   0f 90 80 90 90 90 90[   ]+seto[         ]+-0x6f6f6f70\(%eax\)
>>>>>>>>>>>   *[0-9a-f]+:   0f 91 80 90 90 90 90[   ]+setno[        ]+-0x6f6f6f70\(%eax\)
>>>>>>>>>>>   *[0-9a-f]+:   0f 92 80 90 90 90 90[   ]+setb[         ]+-0x6f6f6f70\(%eax\)
>>>>>>>>>>
>>>>>>>>>> There are instructions like jl and jnl.  Will assembler properly
>>>>>>>>>> handle `l' as a suffix here?
>>>>>>>>>
>>>>>>>>> j<cc> as well as jmp (with displacement) have No_lSuf set, so won't
>>>>>>>>> accept l suffixes (same for the w one). Nevertheless already prior
>>>>>>>>> to this change the disassembler will produce "jmpl" (and "jmpw").
>>>>>>>>> IOW a disagreement between disassembler and assembler already exists.
>>>>>>>
>>>>>>> We should avoid it as much as we can.
>>>>>>>
>>>>>>>>>> If we do need to distinguish them, can we generate {disp32} pseudo prefix
>>>>>>>>>> instead?
>>>>>>>>>
>>>>>>>>> We could, but then consistently for Jcc, JMP, and CALL. But how is
>>>>>>>>> emitting a pseudo-prefix in line with the name of the controlling
>>>>>>>>> command line option "-Msuffix"?
>>>>>>>
>>>>>>> That works for me.
>>>>>>
>>>>>> Okay, but you didn't answer my question.
>>>>>
>>>>> We can always generate pseudo prefix.
>>>>
>>>> But my question was towards "prefix" != "suffix".
>>>>
>>>>>>>> FAOD to achieve consistency I think the preferred route would then
>>>>>>>> be for the assembler to accept l and w suffixes for Jcc and JMP.
>>>>>>>> Not sure though what fallout this may mean.
>>>>>>>
>>>>>>> That could be quite messy.
>>>>>>
>>>>>> I guess I'd still prefer to try that first, and resort to the
>>>>>> alternative only if it really turns out to be.
>>>>>
>>>>> Please give it a try.
>>>>>
>>>>>>>  I think pseudo prefix is much less invasive.
>>>>>>
>>>>>> Maybe to the disassembler; I'm less sure about the testsuites of both
>>>>>> gas and ld.
>>>>>>
>>>>>> Actually - if we were to go this route, then why pseudo prefixes in
>>>>>> the first place? We already emit data{16,32} prefixes for other
>>>>>> reasons, so we could do so here as well in place of the suffixes.
>>>>>
>>>>> There is data32 prefix.  But there is no disp32 prefix.  I call it
>>>>> pseudo prefix.
>>>>
>>>> But the prefix _is_ a data size override, i.e. data{16,32} _is_ what
>>>> we want to express. My question here is why to invent yet another
>>>> prefix when we already have a suitable one.
>>>>
>>>
>>> We should use a prefix which can be processed by assembler.
>>> {dispXX} tells assembler to prefer a specific displacement size.
>>
>> Funny you should say this: There's no {disp16}. And the assembler
>> understands (to a certain degree, but this got improved recently)
>> data16 / data32. IOW yet another argument towards data16 / data32,
>> if we are to go this route in the first place (which right now is
>> only the fallback in case allowing the suffixes in the assembler
>> causes overly difficult problems, and which continues to pend your
>> response to me pointing out that emitting _prefixes_ is not in
>> line with _suffix_ in -Msuffix).
>>
> 
> If needed, we can add {disp16}.

I've been having this on my todo list for quite some time already,
but not because of the considerations here.

What I don't understand is why you _still_ don't answer the question
raised: Do you really think -Msuffix should lead to output like

	{disp32} jmp label

? I could see {dispNN} to have advantages over data16/data32, but
that's independent of -Msuffix, i.e. the prefix would be printed
only if a non-default displacement width was in effect. Printing
{dispNN} for all JMP / CALL / Jcc in suffix-always mode seems like
an extreme form of clutter to me. (As a follow-on to this, we then
ought to also always print {dispNN} for memory accesses with a
word-sized displacement [albeit presumably limited to the no-base-
and-no-index case] - yet more clutter.)

Jan


More information about the Binutils mailing list