This is the mail archive of the cygwin-patches mailing list for the Cygwin project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH v2 1/4] Cygwin: console: Add workaround for broken IL/DL in xterm mode.

From: Takashi Yano <takashi dot yano at nifty dot ne dot jp>
To: cygwin-patches at cygwin dot com
Date: Mon, 2 Mar 2020 09:44:59 +0900
Subject: Re: [PATCH v2 1/4] Cygwin: console: Add workaround for broken IL/DL in xterm mode.
Dkim-filter: OpenDKIM Filter v2.10.3 conssluserg-03.nifty.com 0220itDJ029519
References: <20200226153302.584-1-takashi.yano@nifty.ne.jp> <20200226153302.584-2-takashi.yano@nifty.ne.jp> <05cca441-eb83-4600-90f3-bf82ec7a0190@dronecode.org.uk> <20200228111409.149929dcf710cabf99a879b3@nifty.ne.jp> <20200228133122.GG4045@calimero.vinschen.de> <cc657f02-e3a4-1880-34a2-dcf04d6e902a@t-online.de> <20200301153342.cbc54c2b14687b71679f993a@nifty.ne.jp> <49f801fe-3c86-d2ab-1ea2-b7eea7ad7c5f@t-online.de>

On Sun, 1 Mar 2020 14:56:31 +0100
Hans-Bernhard Bröker wrote:
> Am 01.03.2020 um 07:33 schrieb Takashi Yano:
> 
> > However, from the view point of performance, just inline
> > static function is better. 
> 
> I don't see how that could be the case.  Inline methods of a static C++ 
> object should not suffer any perfomance penalty compared to inline 
> functions operating on static variables.
> 
> > Attached code measures the
> > performance of access speed for wpbuf.
> > I compiled it by g++ 7.4.0 with -O2 option.
> > 
> > The result is as follows.
> > 
> > Total1: 2.315627 second
> > Total2: 1.588511 second
> > Total3: 1.571572 second
> 
> Strange.  The result here (with GCC 9.2) is rather different:
> 
> $ g++ -O2 -o tt wpbuf-bench.cc && ./tt
> Total1: 0.753815 second
> Total2: 0.757444 second
> Total3: 1.217352 second
> 
> And on inspection, all three bench*() functions do appear to have 
> exactly the same machine code, too.  They may be inlined and mixed into 
> main() somewhat differently, though.  That might explain the difference 
> more readily than any actual difference in speed between the three 
> implementations.

I looked into the code generated by g++ 7.4.0 with -O2. The codes
generated are different.

With 32bit compiler,

bench1():
L3:
    cmpl    $255, %edx
    jg  L2
    movb    $65, _wpbuf(%edx)
    movl    $1, %ecx
    addl    $1, %edx
L2:
    subl    $1, %eax
    [...]

bench2(), bench3():
L22:
    cmpl    $255, %edx
    jg  L21
    movb    $65, _wpbuf2(%edx)
    addl    $1, %edx
L21:
    subl    $1, %eax
    [...]

With 64bit compiler,

bench1():
.L3:
    cmpl    $255, %edx
    jg  .L2
    movslq  %edx, %rcx
    addl    $1, %edx
    movb    $65, (%r8,%rcx)
    movl    $1, %ecx
.L2:
    subl    $1, %eax
    [...]

bench2(), bench3():
.L15:
    cmpl    $255, %edx
    jg  .L14
    movslq  %edx, %rcx
    addl    $1, %edx
    movb    $65, (%r8,%rcx)
.L14:
    subl    $1, %eax
    [...]

Obviously, code for bench2() and bench3() is shorter than
bench1().

However, with g++ 9.2.0 with -O2,

bench1(), bench2(), bench3():
L3:
    cmpl    $255, %edx
    jg  L2
    movb    $65, _wpbuf(%edx)
    addl    $1, %edx
L2:
    subl    $1, %eax
    [...]

all the codes are exactly the same, as you mentioned.

So, if we assume g++ 9.2.0, please forget the previous remarks
about speed.

-- 
Takashi Yano <takashi.yano@nifty.ne.jp>

References:
- [PATCH v2 0/4] Modify handling of several ESC sequences in xterm mode.
  - From: Takashi Yano
- [PATCH v2 1/4] Cygwin: console: Add workaround for broken IL/DL in xterm mode.
  - From: Takashi Yano
- Re: [PATCH v2 1/4] Cygwin: console: Add workaround for broken IL/DL in xterm mode.
  - From: Jon Turney
- Re: [PATCH v2 1/4] Cygwin: console: Add workaround for broken IL/DL in xterm mode.
  - From: Takashi Yano
- Re: [PATCH v2 1/4] Cygwin: console: Add workaround for broken IL/DL in xterm mode.
  - From: Corinna Vinschen
- Re: [PATCH v2 1/4] Cygwin: console: Add workaround for broken IL/DL in xterm mode.
  - From: Hans-Bernhard Bröker
- Re: [PATCH v2 1/4] Cygwin: console: Add workaround for broken IL/DL in xterm mode.
  - From: Takashi Yano
- Re: [PATCH v2 1/4] Cygwin: console: Add workaround for broken IL/DL in xterm mode.
  - From: Hans-Bernhard Bröker

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]