This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
Re: [PATCH v2] aarch64: Optimized memset for Kunpeng processor.
- From: "Zhangxuelei (Derek)" <zhangxuelei4 at huawei dot com>
- To: Wilco Dijkstra <Wilco dot Dijkstra at arm dot com>, "libc-alpha at sourceware dot org" <libc-alpha at sourceware dot org>, "siddhesh at gotplt dot org" <siddhesh at gotplt dot org>, Szabolcs Nagy <Szabolcs dot Nagy at arm dot com>, jiangyikun <jiangyikun at huawei dot com>, "yikunkero at gmail dot com" <yikunkero at gmail dot com>
- Cc: nd <nd at arm dot com>
- Date: Mon, 4 Nov 2019 08:54:04 +0000
- Subject: Re: [PATCH v2] aarch64: Optimized memset for Kunpeng processor.
Hi Wilco,
> This won't work, when you align like that you need an extra
> unaligned store at the end.
> The easiest solution is not to align and just keep the stp
> and str using dstin.
>
> +2: stp q0, q0, [dst, 32]
> + stp q0, q0, [dstend, -32]
> + ret
>
> Both of these must use dstend since this is the 1-64 bytes for
> the tail part.
Sorry for my mistakes, and I have corrected them in new patch.
Cheers
Xuelei