Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add new daffodil-tdml-junit library #1389

Merged
merged 2 commits into from
Jan 13, 2025

Conversation

stevedlawrence
Copy link
Member

There is currently a large amount of boilerplate and duplication associated with defining TDML tests with JUnit. To fix this, this adds two new traits that can be mixed in to the companion object and class associated with a JUnit test suite to greatly simplify test suite definitions. Test suites can now be defined with something like this:

object MyTests extends TdmlSuite {
  val tdmlResource = "/resource/path/to/tests.tdml"
}

class MyTests extends TdmlTests {
  val tdmlSuite = MyTests

  @Test def test1 = test
  @Test def test2 = test
  @Test def test3 = test
}

To allow use with older versions of Daffodil, this defaults to using creating a Runner with only the path parameter. This is sufficient for all known schema projects, since most Runner parameters can either be set directly in a TDML file (which is preferred), or are intended for use by Daffodil tests and should not be changed by schema projects.

The sbt-plugin project will eventually be updated to include this as a dependency for DFDL Schema projects, but until that change is released, projects that want to use this new syntax should add the following to dependency to build.sbt file, in addition to any existing daffodil dependencies:

libraryDependencies ++= Seq(
  "org.apache.daffodil" %% "daffodil-tdml-junit" % "<version>" % "test" intransitive()
)

Where "" is hardcoded to the first version that daffodil-tdml-junit is released for. The new library works with older versions of Daffodil, so by hardcoding the version and setting it as an intransitive dependency, it allows the library the be used with TDML tests even when testing older daffodil versions that do not publish the library. As we no longer need to support older versions of Daffodil and sbt-daffodil supports the new library, this will not be needed.

DAFFODIL-2958

@stevedlawrence stevedlawrence force-pushed the daffodil-2958-tdml-junit branch from f98a49b to a2c2eee Compare December 10, 2024 16:09
@stevedlawrence
Copy link
Member Author

Note, I've converted about 75% of existing TDML tests in Daffodil to use this so I'm pretty confident this API covers our needs. All the DFDL Schema projects are much simplier that Daffodil tests, so I'm pretty confident that if we can update all Daffodil TDML test to use this, updating DFDL Schema projects will be straightforward.

Once this API is deemed suitable and is merged I'll create a separate PR that updates everything to use the new API.

Copy link
Contributor

@mbeckerle mbeckerle left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

@stevedlawrence
Copy link
Member Author

I have almost finished converting tests to the new API, and the only issue I've found so far is that it can't be used in Java. Scala adds an abstract equality function that cannot be implemented in a Java class due to the mangled names Scala uses.

We might be able to support Java if TdmlSuite and TdmlTests were written in Java--is that something worth supporting?

@stevedlawrence
Copy link
Member Author

In fact, implementing it in Java might clean things up more by removing the need for the companion object. I think the only thing that must be static is the @AfterClass function and a Java abstract class could implement that and Scala will implicitly create the companion object.

@stevedlawrence
Copy link
Member Author

Actually, Java limitations of static methods makes it pretty difficult to make a reasonable API for this. I think we'll just have to stick with Scala-only support for now.

/**
* Mixin for a JUnit test suite companion class
*/
trait TdmlTests {
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

While I am making modifications to all our test files, I wonder if it's worth considering how we identify tests as broken?

Currently we just comment them out, e.g.

//@Test def foo = test

One alternative to this might be to use the @Ignore annotation, and optionally provide a bug number in the description, e.g.

@Ignore("DAFFODIL-1234") @Test def foo = test

Benefits:

  1. It ensures the "foo" function is valid and isn't duplicated or something.
  2. When running sbt test, it adds a line for each ignored test and at the end of testing it shows the total count of ignored tests. So there's a metric of how many tests are failing. However, this does make the test output a bit more verbose and if you're not aware of why a test might be ignored it might be confusing to see a bunch of ignored tests.
  3. There is a clear place for bug number information, a convention that we could enforce

Here's what it ignored tests look like in SBT:

image

Another approach is to add a new "knownToFail" method, which could also take an optional bug number, e.g.

@Test def foo = knownToFail("DAFFODIL-123")

This has similar benefits to @Ignore, but the knownToFail method could make sure the test actually exists in a TDML file, so there aren't concerns of accidentally moving/deleting a failing test. A downside of this approach is the new function would need to cause an assumption failure to prevent it from actually causing a failure, and in SBT these assumption failures look kindof fatal even though they really aren't. So this might be confusing to new users to see a bunch of red "assumption failed" messages when running tests. We could make it so the error message would include something like "known to fail" but it still could be a bit intimidating.

For reference, here's what this looks like in SBT:

image

Note that @Ignore'd tests show up in stats as "ignored", but "assumption failed" show up as skipped.

Another option, similar to the above, is to add a new attribute to TDML test cases, e.g.

<tdml:parserTestCase ... knownToFail="DAFFODIL-123">

If a test were run with this attribute set it would throw an exception that would be turned into an assumption failure. So it gives the same result as the "knownToFail" method approach, but the Scala files for a failed test are exactly the same as for passing tests. This makes it really easy to generate test scala files from a TDML file since there are no differences between working or known-to-fail test. But otherwise this is the same as the knownToFail method, including the same drawbacks about verbosity. It's just now controlled by the TDML file and the TDML Runner instead of the scala file.

Or, we just keep it as is. We've gone this far by just commenting out tests and that's been sufficient. Just thought this was a good time to consider alternatives since I'm already changing all these files and it potentially affects the TdmlTests API.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I really like the idea of adding knownToFail='DAFFODIL-123 to the TDML files and being able to just generate the scala driver files which then become something of an IDE-support tool to enable running individual tests conveniently from an IDE.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So far knownToFail is pretty straightforward to implement. I did find an another drackback with it--you can't use it with older versions of the TDML runner since it errors if it sees an attribute it doesn't understand. So if we do add support for this you can't use it if you want to maintain compatibility with older daffodil versions.

This is probably okay? We can still use it internally for Daffodil tests and most DFDL schema projects can probably either be update to the latest version of Daffodil, or if they do need to maintain compatibility with older versions they can just stick to commenting out tests (or using @Ignore).

Though, my biggest concern is still that testing Daffodil with this change shows 100+ red "assumption violated" messages, making it look like a lot of things are broken, even though we have 4000+ working tests. I imagine that noise will also make it hard to more difficult to see actual errors among the sea of red messages.

Makes me consider that using @ignore or just keeping with commenting out is the best method.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I vote for the '@ignore' annnotation.

@stevedlawrence stevedlawrence force-pushed the daffodil-2958-tdml-junit branch 2 times, most recently from 29bc34b to 67a5dd4 Compare December 19, 2024 16:08
@stevedlawrence
Copy link
Member Author

I've rebased to the latest master, and added a second commit that converts all of our TDML tests to use the new format. It's big change but makes the tests much less verbose. It's getting difficult to deal with merge conflicts each time a new commit adds tests, so if this second commit could get two +1's I would prefer if this be merged before anything else.

Copy link
Contributor

@mbeckerle mbeckerle left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

Copy link
Contributor

@jadams-tresys jadams-tresys left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

Definitely cuts down on the redundancy when declaring tests in the scala files. The "test_name = test" thing will probably need some explanation if someone new tries to add a test though.

There is currently a large amount of boilerplate and duplication
associated with defining TDML tests with JUnit. To fix this, this adds
two new traits that can be mixed in to the companion object and class
associated with a JUnit test suite to greatly simplify test suite
definitions. Test suites can now be defined with something like this:

    object MyTests extends TdmlSuite {
      val tdmlResource = "/resource/path/to/tests.tdml"
    }

    class MyTests extends TdmlTests {
      val tdmlSuite = MyTests

      @test def test1 = test
      @test def test2 = test
      @test def test3 = test
    }

To allow use with older versions of Daffodil, this defaults to using
creating a Runner with only the path parameter. This is sufficient for
all known schema projects, since most Runner parameters can either be
set directly in a TDML file (which is preferred), or are intended for
use by Daffodil tests and should not be changed by schema projects.

The sbt-plugin project will eventually be updated to include this as a
dependency for DFDL Schema projects, but until that change is released,
projects that want to use this new syntax should add the following to
dependency to build.sbt file, in addition to any existing daffodil
dependencies:

    libraryDependencies ++= Seq(
      "org.apache.daffodil" %% "daffodil-tdml-junit" % "<version>" % "test" intransitive()
    )

Where "<version>" is hardcoded to the first version that
daffodil-tdml-junit is released for. The new library works with older
versions of Daffodil, so by hardcoding the version and setting it as an
intransitive dependency, it allows the library the be used with TDML
tests even when testing older daffodil versions that do not publish the
library. As we no longer need to support older versions of Daffodil and
sbt-daffodil supports the new library, this will not be needed.

DAFFODIL-2958
@stevedlawrence stevedlawrence force-pushed the daffodil-2958-tdml-junit branch from 1b3b57c to e62d8f9 Compare January 13, 2025 14:56
The changes neccessary to use the new API are:

1. Modify imports
2. Extend the new TdmlSuite/TdmlTests traits
3. Split test suites with multiple runners into separate test
   suites--this API only supports a single runner per suite
4. Override createRunner() for suites that need tweaks to the runner,
   usually disabling validate
5. Change names of all test to match the TDML test name
6. Change tests to run by calling the "test" function
7. Change TDML test names that contain characger that are not valid in
   Scala function names, needed since function names are now used to
   find the TDML name
8. Switch to using @ignore to disable tests that are known to fail

Additionally, a few very old tests uses TDML XML embedded in the Scala
source. These were moved into TDML files.

DAFFODIL-2958
@stevedlawrence stevedlawrence force-pushed the daffodil-2958-tdml-junit branch from e62d8f9 to 011ee94 Compare January 13, 2025 15:10
@stevedlawrence stevedlawrence merged commit 3e0e65b into apache:main Jan 13, 2025
10 of 12 checks passed
@stevedlawrence stevedlawrence deleted the daffodil-2958-tdml-junit branch January 13, 2025 15:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants