Evaluation of our current QA Pipeline

jayasanka · November 15, 2022, 2:50pm

Over the last week, I attempted to evaluate our automation technical pipeline, debt, and opportunities. There, I focused on E2E tests because we already have mechanisms in place for unit and integration testing, but E2E tests aren’t well-established yet. Another reason is that, when considering the openMRS architecture, E2E tests provide more accurate feedback than unit and integration tests because they run in a full real-life environment rather than a mock environment.

I reviewed our current approach as well as the documentation of other product teams on E2E test automation, such as Hackney, Wix, and Single SPA, to determine what might be useful for the OMRS application. There, I discovered the following flaws in the current workflow.

E2E tests are not a part of the devs’ workflow.

The developer engagement is extremely low with the current setup. One reason would be that we aren’t providing any immediate feedback for developers. Therefore developers don’t have an idea even if their changes break tests; i.e. break the product. There are countless cases where we have identified critical breakdowns in components weeks after PRs have been merged.
Unclear Guidance

Best practices, and our expectations around testing, are not clearly documented.
Cucumber is slowing down O3 test automation

What Gherkin syntax is really good for is if we have a less technical team to implement tests. You can have a team of developers create a library of steps and then have less technical users write the scenarios, which is not how we work. Everything we’re getting with Cucumber can be done with a good naming convention for tests and having tests properly organised into suites. Making sure that non-technical users can read a report of tests and understand what parts failed; is a crucial component of effective test design. In O3 we use a separate library to integrate cucumber with cypress to have a bridge we can maintain. In order to achieve that, we are sacrificing some cypress fx, especially as cypress upgrades. Those libraries are not well maintained.

Yesterday, @ibacher , @dkigen , @jnsereko and I had a call and discussed the above further and identified the following as the next steps:

Fix failing tests and enable tests in the deployment pipeline

We already have a workflow to run E2E tests on deployment. In order to get a better result, first we need to make sure all the tests are up-to-date with the current implementation.
Improve DevX

In order to make the E2E framework sustainable we need to improve the developer experience.
- Evaluate the current tools and technologies we use for E2E testing.
- Provide immediate feedback to developers
- Store tests in a convenient place The farther away the tests from the codebase are, the fewer attention developers pay to it. We need to make sure tests are placed in a place where developers can easily update the test with their changes. One possible solution would be to have tests in each micro frontend repository itself. So that developers can easily update tests along with their PRs. It is how wix.com address this issue.

Thanks for reading, let me know your feedback!

cc: @dkayiwa @grace @jennifer @christine

dkayiwa · November 16, 2022, 9:26pm

@jayasanka, this is a very useful evaluation!

FWIW, with openmrs-core 2.6.0, we have introduced Testcontainers such that one is able to run all existing tests on MySQL with: mvn test -DuseInMemoryDatabase=false -Ddatabase=mysql or on PostgreSQL with: mvn test -DuseInMemoryDatabase=false -Ddatabase=postgres

As part of the next steps, do you also plan to improve the documentation?

This is one of the best paragraphs that you have ever written for my consumption!!! Ever since we started using cucumber, i have never seen its practical benefit for our community. And i already emphasised this from the very beginning of our cucumber integration:

I like this. But in order to start simple, i would deal with it after all the others are done.

jnsereko · November 17, 2022, 4:20am

dkayiwa:

jayasanka:

What Gherkin syntax is really good for is if we have a less technical team to implement tests. You can have a team of developers create a library of steps and then have less technical users write the scenarios, which is not how we work. Everything we’re getting with Cucumber can be done with a good naming convention for tests and having tests properly organised into suites. Making sure that non-technical users can read a report of tests and understand what parts failed; is a crucial component of effective test design.

This is one of the best paragraphs that you have ever written for my consumption!!! Ever since we started using cucumber, i have never seen its practical benefit for our community. And i already emphasised this from the very beginning of our cucumber integration:

created an issue for this Jira

thembo42 · November 17, 2022, 12:28pm

The overhead and unnecessary dependency of cucumber will ultimately pause a risk to slow down the CI/CD and qa framework process. Most of the qa engineers have actually been trying to overide the aim and goal of Cucumber creator, Aslak Hellesoy. He said in one post;

If you think Cucumber is a testing tool, please read on, because you are wrong.

Cucumber was born out of the frustration with ambiguous requirements and misunderstandings between the people who order the software and those who deliver it.

openmrs we need to abrace ourselves with automation best practices.

jayasanka · November 27, 2022, 1:58pm

Thanks for all of your responses!

Yes that would be a priority too.

Here is a quick update on the current state:

Tests are fixed. Distribution 3.x E2E and scheduled tests on GitHub are passing now. The next step is to enable the pipeline to run on commits.

Screenshot 2022-11-27 at 19.03.53960×1304 151 KB
I was able to run E2E tests against a PR. Link

image1684×1176 115 KB

However, it took 38 mins to run the GitHub action (16 mins to spin up the container, 17 mins to run tests, and 5 mins for other tasks) We can decrease the time by either
1. run it on bamboo (not sure how to run a bamboo plan with a PR)
2. or configure tests to run without a container in GitHub.
3. run on bamboo, run tests parallelly
4. run on GitHub without a container, run tests parallelly
My rough time estimation for each above options are
1. 17 min (the container spins immediately )
2. 17 min (no need for a container)
3. 4 min (tests runs parallelly; est. time = max(time taken by a spec))
4. 4 min (same as above)
Currently I’m trying to evaluate Cypress and Playwright. We are using both tools right now; playwright for offline tests and cypress for other tests. Both tools provide the same functionality, but Cypress has some issues with offline testing. Tests were already implemented with cypress when the offline testing requirement came. That’s the reason for having two tools. Btw, I found a nice comparison.

https://www.youtube.com/watch?v=RwNZTjwhgXc

raff · November 29, 2022, 9:22am

Thanks for your work on this!

Do we create DB from scratch? If we import a DB dump instead, we should significantly reduce the time needed to startup.

jayasanka · December 1, 2022, 3:54pm

Yeah, it’s creating the DB from scratch. Maybe that’s what slowing down the startup. I’ll give it a try.

jayasanka · December 9, 2022, 10:40am

Hi all,

As a first step towards migrating tests to relevant repositories, the following PR aims to ensure that tests are located in the same repository as the code they are testing and can be run automatically as part of pull requests or individual commits. This will provide a more streamlined and efficient testing process, allowing tests to be run automatically as part of the development workflow. The PR addresses the requirements outlined in the associated Jira issue.

github.com/openmrs/openmrs-esm-patient-management

O3-1690: Setup E2E testing

openmrs:main ← jayasanka-sack:O3-1690

opened 01:46PM - 08 Dec 22 UTC

jayasanka-sack

+624 -6

## Requirements - [x] This PR has a title that briefly describes the work don…e, including the ticket number if there is a ticket. - [x] My work conforms to the [**OpenMRS 3.0 Styleguide**](https://om.rs/styleguide). - [x] I checked for feature overlap with [**existing widgets**](https://om.rs/directory). Fix: https://issues.openmrs.org/browse/O3-1690 ## Summary As a first step towards migrating tests to relevant repositories, this PR aims to ensure that tests are located in the same repository as the code they are testing and can be run automatically as part of pull requests or individual commits. This will provide a more streamlined and efficient testing process, allowing tests to be run automatically as part of the development workflow. The PR addresses the requirements outlined in the associated Jira issue. ### [▶️ Watch Demo!](https://www.youtube.com/watch?v=CinijlHwdaA&ab_channel=JayasankaWeerasinghe) ## Running tests Once everything is set up, ```sh # Run all e2e tests on chromium in headed mode: yarn test-e2e --headed ``` To run a specific test by title: ```sh yarn test-e2e --headed -g "title of the test" ``` Check [this documentation](https://playwright.dev/docs/running-tests#command-line) for more running options. ## Writing New Tests In general, it is recommended to read through the official [Playwright docs](https://playwright.dev/docs/intro) before writing new test cases. The project uses the official Playwright test runner and, generally, follows a very simple project structure: ``` e2e |__ commands | ^ Contains "commands" (simple reusable functions) that can be used in test cases/specs. |__ core | ^ Contains code related to the test runner itself, e.g. setting up the custom fixtures. | You probably need to touch this infrequently. |__ fixtures | ^ Contains fixtures (https://playwright.dev/docs/test-fixtures) which are used | to run reusable setup/teardown tasks |__ pages | ^ Contains page object model classes for interacting with the frontend. | See https://playwright.dev/docs/test-pom for details. |__ specs ^ Contains the actual test cases/specs. New tests should be placed in this folder. ``` When you want to write a new test case, start by creating a new spec in `./specs`. Depending on what you want to achieve, you might want to create new fixtures and/or page object models. To see examples, have a look at the existing code to see how these different concepts play together. ## Github Action integration The e2e.yml workflow is made up of two jobs: one for running on pull requests (PRs) and one for running on commits. 1. When running on PRs, the workflow will start the dev server, use dev3.openmrs.org as the backend, and run tests only on chromium. This is done in order to quickly provide feedback to the developer. The tests are designed to generate their own data and clean up after themselves once they are finished. This ensures that the tests will have minimum effect from changes made to dev3 by other developers. In the future, we plan to use a docker container to run the tests in an isolated environment once we figure out a way to spin up the container within a small amount of time. 2. When running on commits, the workflow will spin up a docker container and run the dev server against it in order to provide a known and isolated environment. In addition, tests will be run on multiple browsers (chromium, firefox, and WebKit) to ensure compatibility. ## Open reports from GitHub Actions / Bamboo To download the report from the GitHub action/Bamboo plan, follow these steps: 1. Go to the artifact section of the action/plan and locate the report file. 5. Download the report file and unzip it using a tool of your choice. 6. Open the index.html file in a web browser to view the report. The report will show you a full summary of your tests, including information on which tests passed, failed, were skipped, or were flaky. You can filter the report by browser and explore the details of individual tests, including any errors or failures, video recordings, and the steps involved in each test. Simply click on a test to view its details. Sample runs: 1. [On PR](https://github.com/openmrs/openmrs-esm-patient-management/actions/runs/3656120450) 1. [On commit](https://github.com/jayasanka-sack/openmrs-esm-patient-management/actions/runs/3655623243) ### 🎃 Sample Report: [Click Me!!!](http://sample-playwright-report-jayasanka.surge.sh/) 🎃 Check [retry #1 of the failing test](http://sample-playwright-report-jayasanka.surge.sh/#?testId=790ef85a432fb5fce601-8977c99b141efac5f1c4) for video and trace. ## Debugging Tests Refer to [this documentation](https://playwright.dev/docs/debug) on how to debug a test. ## Efforts to improve the execution time 1. Using the `dev3` server as a backend for the PR runs The docker container takes 12 - 20 mins to spin up. Therefore the PR runs using the dev3 server as the backend so that it will save the time taken to spin up a server. 4. Save the signed in the state Before running tests, the system will sign in using the API only once and store the signed-in state for reuse among tests. For more information, see the following link: https://playwright.dev/docs/auth#reuse-signed-in-state 5. Run specs in parallel Tests will be run in parallel to save time. Thanks so much to @rbuisson! The offline testing implementation really helped out a lot! ❤️

Demo: