This is the mail archive of the binutils@sourceware.org mailing list for the binutils project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH] x86: Encode 256-bit/512-bit VEX/EVEX insns with 128-bit VEX

From: "Jan Beulich" <JBeulich at suse dot com>
To: "H.J. Lu" <hjl dot tools at gmail dot com>
Cc: <binutils at sourceware dot org>
Date: Mon, 18 Mar 2019 05:42:15 -0600
Subject: Re: [PATCH] x86: Encode 256-bit/512-bit VEX/EVEX insns with 128-bit VEX
References: <20190317100804.22160-1-hjl.tools@gmail.com>

>>> On 17.03.19 at 11:08, <hjl.tools@gmail.com> wrote:
> --- a/gas/config/tc-i386.c
> +++ b/gas/config/tc-i386.c
> @@ -3977,8 +3977,7 @@ optimize_encoding (void)
>  	    }
>  	}
>      }
> -  else if (optimize > 1
> -	   && i.reg_operands == 3
> +  else if (i.reg_operands == 3
>  	   && i.op[0].regs == i.op[1].regs
>  	   && !i.types[2].bitfield.xmmword
>  	   && (i.tm.opcode_modifier.vex
> @@ -4009,15 +4008,15 @@ optimize_encoding (void)
>  		|| i.tm.base_opcode == 0x6647)
>  	       && i.tm.extension_opcode == None))
>      {
> -      /* Optimize: -O2:
> +      /* Optimize: -O1:
>  	   VOP, one of vandnps, vandnpd, vxorps, vxorpd, vpsubb, vpsubd,
>  	   vpsubq and vpsubw:
>  	     EVEX VOP %zmmM, %zmmM, %zmmN
>  	       -> VEX VOP %xmmM, %xmmM, %xmmN (M and N < 16)
> -	       -> EVEX VOP %xmmM, %xmmM, %xmmN (M || N >= 16)
> +	       -> EVEX VOP %xmmM, %xmmM, %xmmN (M || N >= 16) (-O2)
>  	     EVEX VOP %ymmM, %ymmM, %ymmN
>  	       -> VEX VOP %xmmM, %xmmM, %xmmN (M and N < 16)
> -	       -> EVEX VOP %xmmM, %xmmM, %xmmN (M || N >= 16)
> +	       -> EVEX VOP %xmmM, %xmmM, %xmmN (M || N >= 16) (-O2)
>  	     VEX VOP %ymmM, %ymmM, %ymmN
>  	       -> VEX VOP %xmmM, %xmmM, %xmmN
>  	   VOP, one of vpandn and vpxor:
> @@ -4026,17 +4025,17 @@ optimize_encoding (void)
>  	   VOP, one of vpandnd and vpandnq:
>  	     EVEX VOP %zmmM, %zmmM, %zmmN
>  	       -> VEX vpandn %xmmM, %xmmM, %xmmN (M and N < 16)
> -	       -> EVEX VOP %xmmM, %xmmM, %xmmN (M || N >= 16)
> +	       -> EVEX VOP %xmmM, %xmmM, %xmmN (M || N >= 16) (-O2)
>  	     EVEX VOP %ymmM, %ymmM, %ymmN
>  	       -> VEX vpandn %xmmM, %xmmM, %xmmN (M and N < 16)
> -	       -> EVEX VOP %xmmM, %xmmM, %xmmN (M || N >= 16)
> +	       -> EVEX VOP %xmmM, %xmmM, %xmmN (M || N >= 16) (-O2)
>  	   VOP, one of vpxord and vpxorq:
>  	     EVEX VOP %zmmM, %zmmM, %zmmN
>  	       -> VEX vpxor %xmmM, %xmmM, %xmmN (M and N < 16)
> -	       -> EVEX VOP %xmmM, %xmmM, %xmmN (M || N >= 16)
> +	       -> EVEX VOP %xmmM, %xmmM, %xmmN (M || N >= 16) (-O2)
>  	     EVEX VOP %ymmM, %ymmM, %ymmN
>  	       -> VEX vpxor %xmmM, %xmmM, %xmmN (M and N < 16)
> -	       -> EVEX VOP %xmmM, %xmmM, %xmmN (M || N >= 16)
> +	       -> EVEX VOP %xmmM, %xmmM, %xmmN (M || N >= 16) (-O2)
>  	   VOP, one of kxord and kxorq:
>  	     VEX VOP %kM, %kM, %kN
>  	       -> VEX kxorw %kM, %kM, %kN

I disagree - as per my earlier reply to another patch there shouldn't
be any use of cpu_arch_flags here, and -O2 should not all of the
sudden imply AVX512VL to be available when it wasn't explicitly
enabled. Effectively this will make it impossible to add any other,
ISA-independent optimization to -O2 later on.

Jan

Follow-Ups:
- Re: [PATCH] x86: Encode 256-bit/512-bit VEX/EVEX insns with 128-bit VEX
  - From: H.J. Lu

References:
- [PATCH] x86: Encode 256-bit/512-bit VEX/EVEX insns with 128-bit VEX
  - From: H.J. Lu

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]