[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Alternative nSelectors patch (Was: bzip2 1.0.7 released)

To: Mark Wielaard <mark@klomp.org>, Federico Mena Quintero <federico@gnome.org>, bzip2-devel@sourceware.org
Subject: Re: Alternative nSelectors patch (Was: bzip2 1.0.7 released)
From: Julian Seward <jseward@acm.org>
Date: Tue, 2 Jul 2019 08:34:03 +0200
Authentication-results: sourceware.org; auth=none
Delivered-to: listarch-bzip2-devel@sourceware.org
Delivered-to: mailing list bzip2-devel@sourceware.org
Dkim-filter: OpenDKIM Filter v2.11.0 squarecat1.vs.mythic-beasts.com 54B752934A
Dkim-filter: OpenDKIM Filter v2.11.0 mail.kestrel.ws CAD7523EEC
In-reply-to: <308d9e82220760205ee673bf0505ee1815d48596.camel@klomp.org>
List-help: <mailto:bzip2-devel-help@sourceware.org>
List-id: <bzip2-devel.sourceware.org>
List-post: <mailto:bzip2-devel@sourceware.org>
List-subscribe: <mailto:bzip2-devel-subscribe@sourceware.org>
Mailing-list: contact bzip2-devel-help@sourceware.org; run by ezmlm
References: <b8aab785a67113e4f50c54a6cb59129c11f805b6.camel@klomp.org> <20190627205837.GD9273@wildebeest.org> <0a2331bc6d0c8500c2c45df1e3ebe01b49ad5831.camel@klomp.org> <8c4d5cf2479253406dacdee122692cc77771afb9.camel@gnome.org> <e73799c0-cb3b-c9f8-84c8-53d028dbe1d5@acm.org> <9998ca428c4c7f895a543aa91941e58efb0d5291.camel@klomp.org> <308d9e82220760205ee673bf0505ee1815d48596.camel@klomp.org>
Reply-to: jseward@acm.org
Sender: bzip2-devel-owner@sourceware.org
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.7.2


Hi Mark,

This seems to me like a better patch than my proposal, so I retract my
proposal and vote for this one instead.

The one thing that concerned me was that, it would be a disaster -- having
ignored all selectors above 18002 -- if subsequent decoding actually *did*
manage somehow to try to read more than 18002 selectors out of s->selectorMtf,
because we'd be reading uninitialised memory.  But this seems to me can't
happen because, after the selector-reading loop, you added

+      if (nSelectors > BZ_MAX_SELECTORS)
+        nSelectors = BZ_MAX_SELECTORS;

and the following loop:

      /*--- Undo the MTF values for the selectors. ---*/
      ...

is the only place that reads s->selectorMtf, and then only for the range
0 .. nSelectors-1.

So it seems good to me.  Does this sync with your analysis?

J


On 01/07/2019 01:36, Mark Wielaard wrote:

Hi,

On Fri, 2019-06-28 at 13:10 +0200, Mark Wielaard wrote:

It seems to me to be important to now split BZ_MAX_SELECTORS into these two
parts so as to make it clear to everybody that we're accepting (decompressing)
a slightly larger set of inputs than we create (a la that old saying about
network protocol implementations), so as to tolerate other compressors.


That seems good. The attached patch does this and makes it possible to
decode the problematic bz2 file.


Sorry, it is a bit too late here to properly document this patch and
explain why I think it is a better one than the "split-max-selectors"
fix. But hopefully the new testsuite example and the comment in the
patch make clear what my thinking is.

This resolved both the issue with the large file reported as with the
new test suite file (lbzip2/32767.bz2). The whole testsuite passes now,
even under valgrind and with gcc -fsanitize=undefined.

Comments on the patch idea more than welcome.

Thanks,

Mark

Follow-Ups:
- Re: Alternative nSelectors patch (Was: bzip2 1.0.7 released)
  - From: Mark Wielaard <mark@klomp.org>

Prev by Date: [COMMITTED] Add bzip2-test to downloads.
Next by Date: [PATCH] Fix include path separator
Previous by thread: [COMMITTED] Add bzip2-test to downloads.
Next by thread: Re: Alternative nSelectors patch (Was: bzip2 1.0.7 released)
Index(es):
- Date
- Thread