By using Midscene.js playground, you can quickly experience the main features of Midscene on iOS devices, without needing to write any code.
The playground shares the same codebase as the @midscene/ios
package, so you can consider it as a playground or debugging tool for the Midscene iOS SDK.
Install Node.js 18 or higher.
Prepare an API Key for a visual language (VL) model.
You can find supported models and configurations for Midscene.js in the Choose a Model documentation.
Before getting started, you need to set up the iOS development environment:
Before using Midscene iOS, you need to prepare the WebDriverAgent service. Please refer to the official documentation for setup:
After completing the configuration, you can verify whether the service is working properly by accessing WebDriverAgent's status endpoint:
Access URL: http://localhost:8100/status
Correct Response Example:
If you can successfully access this endpoint and receive a similar JSON response as shown above, it indicates that WebDriverAgent is properly configured and running.
Click the gear button to enter the configuration page and paste your API key config.
Refer to Config Model and Provider document, config the API Key.
After the configuration, you can immediately experience Midscene. It provides multiple key operation tabs, including but not limited to:
All Agent APIs can be directly debugged and run in the Playground! Interactive, extraction, and verification methods are fully covered, with visual operations and verification that boost your automation development efficiency!
Enjoy !
For the different between "Auto Planning" and "Instant Action", please refer to the API document.
After experiencing, you may want to write some code to integrate Midscene. There are multiple ways to do that. Please refer to the documents below: