Replace Thread.sleep() with proper waitFor() conditions and explicitly
dismiss notifications to prevent test timeouts.
The testTrackActionId test was failing because the IdeaVim notification
balloon remained visible after clicking "Stop Tracking", causing a 5-second
timeout when waiting for "Copy Action Id" text to disappear.
Changes:
- Replace Thread.sleep(1000) after clicking "Copy Action Id" with waitFor
condition that checks for "Stop Tracking" button to appear
- Remove Thread.sleep(1000) after clicking "Stop Tracking"
- Add explicit notification dismissal using Escape key
- Increase timeout for final waitFor to 3 seconds to allow dismissal
This follows the cause-effect principle for UI tests by waiting for
specific UI conditions rather than arbitrary timeouts.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
PyCharm now opens with a project directly instead of showing the Welcome Frame. Removed the project creation logic and unused imports.
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Add explicit cache-read-only: false to ensure Gradle cache
is properly written and read on macOS builds, matching the
configuration used in the unified IntelliJ workflow.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
The RiderUiTest was failing because it didn't handle the .NET SDK
installation dialog that appears when creating a new Rider project
in a fresh environment.
Root cause:
- When the "New Solution" dialog opens, Rider detects that no .NET SDK
is installed and shows an "Install" button
- The "Create" button remains disabled until the SDK is installed
- The test was trying to click the disabled "Create" button, which
failed silently, leaving the dialog open
- The test then timed out waiting for the main IDE frame to appear
Changes:
1. Added SDK installation handling in startNewProject():
- Detects and clicks the "Install" button if present
- Waits 10 seconds for SDK installation to complete
- Waits additional 2 seconds before clicking Create
2. Made license frame handling optional:
- Wrapped manageLicensesFrame in try-catch
- Prevents failure if license is already activated
3. Increased timeouts for project creation:
- Extended IDE frame lookup timeout from 10s to 60s
- Added 5-second wait after clicking Create button
- These changes accommodate the time needed for project creation
Test results:
- Test now passes successfully (100% pass rate)
- Duration: ~1 minute (previously timed out at ~20 seconds)
- All test assertions pass
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
The test was failing because both the "Activation code" radio button
and the text input field share the same accessiblename attribute.
The XPath selector "//div[@accessiblename='Activation code']" was
matching both elements, causing the test to fail when trying to set
text on the first match (the radio button, which is not a JTextComponent).
Solution: Add a classhierarchy filter to the XPath to ensure we only
match the text input field component (JTextComponent), not the radio button.
This fix makes the selector more specific and prevents the ambiguity
that was causing the test failure.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
The UI test was failing because it relied on an obfuscated class name 'W'
to locate the activation code text field. The class name changed to 'Y' in
a recent version of JetBrains Rider, causing the test to fail with
NoSuchElementException.
This change uses the stable 'accessiblename' attribute instead, which:
- Is more stable across UI updates
- Is semantically meaningful and self-documenting
- Follows accessibility best practices
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Added --configuration-cache flag to buildPlugin task in all Linux
UI test workflows to significantly improve build performance.
The configuration cache stores the result of the configuration phase
and reuses it for subsequent builds, avoiding the need to re-evaluate
build scripts when inputs haven't changed. This is particularly
beneficial for UI tests that run frequently on schedule.
Note on "Gradle User Home cache not found" message: This is expected
when the commit SHA changes, as the gradle-home cache key includes it
for security. However, the more important dependency and transform
caches use content-based hashing and are restored correctly across
commits, which is why builds still benefit from caching.
Updated workflows:
- runUiTestsLinux.yml
- runUiRdTestsLinux.yml
- runUiPyTestsLinux.yml
- runUiTestsUnified.yml (both jobs)
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Added explicit instruction to all UI test failure analysis prompts:
"IMPORTANT: If you have a concrete suggestion for fixing the test,
ALWAYS proceed with creating a branch and PR. Never ask for
permission - just do it."
This ensures Claude Code will automatically create fix PRs when it
identifies concrete solutions, eliminating the need for user
confirmation and enabling fully automated test maintenance.
Updated workflows:
- runUiTests.yml (IntelliJ macOS)
- runUiTestsLinux.yml (IntelliJ Linux)
- runUiTestsUnified.yml (IntelliJ unified)
- runUiRdTests.yml (Rider macOS)
- runUiRdTestsLinux.yml (Rider Linux)
- runUiPyTests.yml (PyCharm macOS)
- runUiPyTestsLinux.yml (PyCharm Linux)
- runUiOctopusTests.yml (Octopus macOS)
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Created a unified workflow that runs macOS and Linux tests in parallel,
then performs a single AI analysis of failures from both platforms.
Key features:
1. Parallel execution: test-macos and test-linux run simultaneously
2. Separate artifact uploads: macos-reports and linux-reports
3. Unified analysis job that:
- Triggers if either platform fails
- Downloads artifacts from both platforms
- Provides Claude Code with context from both test runs
- Identifies common vs platform-specific issues
- Creates a single PR for common issues
- Clearly labels platform-specific fixes
Benefits:
- Single unified fix for issues affecting both platforms
- Better context for AI analysis by comparing across platforms
- Reduced PR noise (one PR instead of two for common issues)
- Cost efficiency (one AI analysis instead of two)
The analyze-failures job has git and gh tools enabled to allow
automatic branch creation and PR submission when fixes are identified.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Windows UI tests have been removed. The project will continue to run
UI tests on macOS and Linux platforms, which provide sufficient coverage
for UI testing across different operating systems.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Added the "Auto-click Allow button for screen recording permission" step
to all macOS UI test workflows that were missing it:
- runUiRdTests.yml (Rider macOS)
- runUiPyTests.yml (PyCharm macOS)
- runUiOctopusTests.yml (Octopus macOS)
This step automatically dismisses the macOS screen recording permission
dialog that appears when ffmpeg starts recording. Without this automation,
the dialog blocks the test execution and causes timeouts.
The step tries multiple coordinate positions using both cliclick and
AppleScript fallback to handle different screen resolutions and dialog
positions.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Updated all UI test workflow prompts to instruct Claude Code to
automatically create fixes when concrete solutions are identified:
1. Create a branch with descriptive name
2. Apply the suggested fix to the codebase
3. Run the specific failing test to verify the fix works
4. Create a PR if the test passes with clear documentation
Each workflow includes the appropriate test command for its IDE type:
- IntelliJ/Octopus: gradle :tests:ui-ij-tests:testUi --tests "..."
- Rider: gradle :tests:ui-rd-tests:testUi --tests "..."
- PyCharm: gradle :tests:ui-py-tests:testUi --tests "..."
This enables fully automated test fix proposals without manual
intervention, reducing the feedback loop for fixing flaky or broken
UI tests caused by platform changes.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Clarified that existing UI tests without OS specification run on macOS
by updating workflow names to include "macOS" suffix.
Created Linux versions of Rider and PyCharm UI tests:
- runUiRdTestsLinux.yml: Rider tests on Linux with Xvfb setup
- runUiPyTestsLinux.yml: PyCharm tests on Linux with Xvfb and Python 3.10
Both new workflows follow the same Linux setup pattern as the existing
runUiTestsLinux.yml workflow, using x11grab for screen recording and
appropriate IDE type parameters (-PideaType=RD/PC).
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
IntelliJ Platform now appends file type suffixes to accessible names
for improved accessibility (e.g., "MyTest.java, Java file" instead of
just "MyTest.java"). This caused UI tests to fail when trying to find
and click on Java file tabs.
Updated the XPath selector in Editor.kt to use a flexible matcher that
handles both:
- Exact match for older IntelliJ versions or files without suffixes
- Prefix match with comma for files with type suffixes
The comma in 'starts-with(@accessiblename,'$title,')' ensures we don't
accidentally match unintended tabs that happen to start with the same
characters.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
When UI tests fail due to timeouts but the element is visible in the
video/screenshot, the failure may be caused by renamed properties or
changed class names. The hierarchy HTML file contains the actual UI
structure and can help identify these issues.
Updated all UI test workflows (macOS, Linux, Windows, Rider, PyCharm,
and Octopus) to instruct Claude Code to check build/reports/hierarchy-ideaVimTest.html
and suggest updated queries when this scenario occurs.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
This test is currently failing and needs investigation.
See AI analysis in CI for details.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Allows the AI to remove temporary files when creating thumbnail grids
from screen recordings for better video analysis.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Creates a separate workflow for running UI tests on Ubuntu with:
- Xvfb virtual display for headless GUI testing
- FFmpeg screen recording using x11grab
- Claude Code AI analysis of test failures
- Artifact upload for test reports and recordings
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Changed Windows UI test workflow from PowerShell to bash:
- Run Idea: Now properly runs in background with `&`
- Screen recording: Uses bash syntax for background process and PID capture
- Stop recording: Uses standard bash kill command
Problem: PowerShell Start-Process completed immediately instead of
keeping gradle running in background, causing IntelliJ to never start
and health check to fail.
Solution: Use bash shell (available via Git Bash on Windows runners)
which properly handles background processes with `&` syntax.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Added AppleScript automation to automatically grant screen recording
permission when the macOS permission dialog appears:
- Waits 2 seconds after ffmpeg starts
- Tries up to 10 times (1 second intervals) to click Allow button
- Uses SecurityAgent process to interact with system permission dialog
- Fails gracefully if dialog doesn't appear
This eliminates the manual permission prompt that was blocking
automated screen recording on macOS GitHub Actions runners.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Fixed Windows build failure where KSP plugin couldn't be resolved.
Added mavenCentral() to pluginManagement repositories to ensure
plugins like com.google.devtools.ksp can be found on all platforms.
The KSP plugin is published to Maven Central and wasn't being found
on Windows runners because only Sonatype snapshots and Gradle Plugin
Portal were configured.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Created separate workflow for Windows UI tests:
- Runs on windows-latest
- FFmpeg installation via Chocolatey
- Screen recording using gdigrab (Windows GDI Grabber)
- No permission dialogs on Windows (unlike macOS)
- PowerShell for process management
- Gradle caching enabled
- Only runs IJ tests for now
- AI analysis on test failures
- Records at 30fps with H.264 codec
This provides an alternative platform for UI testing with
working screen recording capabilities.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Removed --no-configuration-cache flag from all runIdeForUiTests commands
to enable Gradle configuration cache and improve build performance.
This works together with the gradle/actions/setup-gradle@v4 action
to provide optimal caching.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Enabled gradle/actions/setup-gradle@v4 for all UI test workflows:
- runUiTests.yml
- runUiPyTests.yml
- runUiRdTests.yml
- runUiOctopusTests.yml
This action automatically caches:
- Gradle wrapper
- Dependencies
- Gradle build cache
This should significantly speed up the "Build Plugin" step on
subsequent runs.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Extended macOS CI screen recording to all UI test workflows:
- Added FFmpeg screen recording to PyCharm, Rider, and Octopus tests
- Removed Linux UI tests workflow (runUiTestsLinux.yml)
Removed video-recorder-junit5 library from codebase:
- Removed dependency from all UI test modules
- Removed video recorder system properties from Gradle configs
- Removed @Video annotations and imports from all test files
- Removed "Move video" steps from all workflows
Updated Claude AI analysis prompts:
- Changed from dual video sources to single CI recording
- Added helpful ffmpeg commands for video analysis:
* Extract frames at specific times
* Create thumbnail grids
* Get video duration
* Extract last N seconds
This simplifies the video recording setup by relying solely on
CI-level screen capture, which provides complete coverage of the
entire test run.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Changed FFmpeg device from "1:0" to "0:none" for proper screen capture.
Added device listing step for debugging.
The previous device index caused "Invalid device index" error.
Using "0:none" captures screen 0 without audio, which is the correct
format for avfoundation screen capture on macOS runners.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Added FFmpeg screen recording at CI level using avfoundation:
- Captures full screen at 30fps starting when IntelliJ launches
- Saves to build/reports/ci-screen-recording/screen-recording.mp4
- Complements Gradle test video with full session recording
- Updated AI analysis prompt to reference both video sources
This provides a complete recording of the entire test run, which may
catch issues that occur outside the focused Gradle test recording.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Fixes repeatedly calling repaint while trying to paint the caret. This is due to a combination of the modelToView translation returning a location instead of a bounding box (the width can be zero) and a check that one rectangle contains another returning false when one rectangle has a width of zero.
Also fixes the caret not redrawing properly while flashing when first shown. This is due to the width and height not being properly initialised.
Created new runUiTestsLinux.yml with modern Linux configuration:
- Uses ubuntu-latest with Xvfb for headless display
- FFmpeg for video recording
- Same test suite as macOS (ui-ij-tests only)
- AI analysis on test failures
- Updated to Java 21 and latest action versions
Removed old commented-out Linux job from runUiTests.yml.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
The Monte recorder uses tscc (Camtasia) codec by default, which has
significant decoding limitations. When attempting to use Claude Code
and ffmpeg to analyze test failure videos, ffmpeg was unable to
properly extract frames beyond the first 7 seconds, even though the
full 27-second video was recorded.
The FFmpeg recorder produces standard video formats that can be fully
decoded and analyzed, enabling proper AI-assisted test failure analysis.
Configuration added to all UI test projects:
- tests/ui-ij-tests
- tests/ui-py-tests
- tests/ui-rd-tests
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>