Platform Monitoring
The Platform Monitoring module allows users to track the real-time health and deployment activity of the Gesund.ai platform. It provides a comprehensive view of system services, model servers, and model deployment statuses, supporting proactive maintenance and troubleshooting.
System Services Status
A live dashboard shows the operational status of essential backend services.
- Service Name: Identifies the service (e.g., MongoDB, Validation).
- Status: Displays whether the service is operational (
UP
) or down (DOWN
). - Last Update: Timestamp of the most recent status check, if available.
Typical services include:
- Autodeploy
- Validation
- MongoDB
- NodeJS Tool
- DB Manager
- Deployment Manager
Tip: Regularly checking service health helps maintain platform stability and fast issue detection.
Model Server List
This section lists the available model servers that are currently connected to the platform.
Each row shows:
- No: The server index.
- Name: The name assigned to the model server.
- Address: The server's IP and port.
- Delete: A button to remove the server from the platform.
No | Name | Address | Delete |
---|---|---|---|
1 | modelserver-1 | [http://34.57.xxx.xx:xxxx] | 🗑️ |
You can add a new server using the Add new model server button.
Model Deployment Status
This section displays the current deployment status of models registered on the platform.
- Model Name: The descriptive name of the deployed model.
- Model ID and Deployment ID: Unique identifiers used for referencing.
- Status:
UP
: Model is live and functioning.INPROGRESS
: Deployment is ongoing.FAILED
: Deployment failed and needs attention.
A Stop Models button is available in the upper right corner to terminate all running or in-progress deployments in one click.
How to Use Platform Monitoring
- Navigate to Platform Monitoring via the main menu.
- Review the System Services section to ensure core infrastructure is functioning.
- Scroll to the Model Server List to manage server connections.
- Continue to Model Deployment Status to observe current deployment activity.
- Click on a model or deployment ID for more details or diagnostics.
- Use the Stop Models button if emergency intervention is needed.
Access and Security
- This page should be restricted to administrators or authorized DevOps users.
- Unauthorized access may expose infrastructure-level controls and sensitive information.
Troubleshooting
- DOWN service: Restart using platform tools or contact the system administrator.
- FAILED deployment: Click the deployment ID to review logs and resolve errors.
- Missing model server: Re-register using Add new model server.