Re: Automated testing for users' LilyPond collections with new developme

lilypond-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Automated testing for users' LilyPond collections with new developme

From:	Jean Abou Samra
Subject:	Re: Automated testing for users' LilyPond collections with new development versions
Date:	Wed, 30 Nov 2022 23:44:02 +0100
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.5.0

Le 28/11/2022 à 23:49, Karlin High a écrit :

This message intrigued me:

<https://lists.gnu.org/archive/html/lilypond-devel/2022-11/msg00222.html>
In it, Eric Benson reported a setup that allows testing new versionsof LilyPond on a sizable body of work in a somewhat automated fashion.
Now, could automation like that also make use of the infrastructurefor LilyPond's regression tests?
<http://lilypond.org/doc/v2.23/Documentation/contributor/regtest-comparison>
What effort/value would there be in making an enhanced convert-ly toolthat tests a new version of LilyPond on a user's entire collection ofwork, reporting differences between old and new versions inperformance and output?
Enabling something like this:

* New release of LilyPond comes out. Please test.
* Advanced users with large collections of LilyPond files do theequivalent of "make test-baseline," but for their collection insteadof LilyPond's regtests. Elapsed time is recorded, also CPU and RAMinfo as seems good.
* New LilyPond gets installed
* Upgrade script runs convert-ly on the collection, first offeringbackup via convert-ly options or tarball-style.
* Equivalent of "make check" runs
* A report generates, optionally as email to lilypond-devel, withsummary of regression test differences and old-vs-new elapsed time.
Ideally, this could quickly produce lots of good testing info fordevelopment versions of LilyPond, in a way encouraging userparticipation.





How much work: I don't know. Nonzero, probably not big.

Keep in mind, however, that on a regular basis, there is a change thatgenerates lots of small differences, so you are likely to get mostlynoise from a comparison like this. You can only really do it betweenconsecutive unstable releases, because if you compare the last stablerelease with the current unstable release (assuming that a few unstablereleases have passed since the stable one), the noise will likely beoverwhelming. For this reason, the testers need to be really dedicated.


Best,
Jean

OpenPGP_signature
Description: OpenPGP digital signature

[Prev in Thread]

Current Thread

[Next in Thread]

Automated testing for users' LilyPond collections with new development versions, Karlin High, 2022/11/28
- Re: Automated testing for users' LilyPond collections with new development versions, Jean Abou Samra <=

Prev by Date: Re: LilyPond 2.23.82
Next by Date: Re: LilyPond 2.23.82
Previous by thread: Automated testing for users' LilyPond collections with new development versions
Next by thread: PATCHES - Countdown to November 30
Index(es):
- Date
- Thread