1 of 1 people found this helpful
- To track if a subsystem is up and running, the best way is to take a look at the \\<Server>\Process(<subsystem>)\Elapsed Time. If the value of the counter has not updated in X amount of time, that would indicate that it might not be running.
- To monitor the health of the archive subsystem (answering the "is the data being archived at this time" question), take a look at the following counters:
PI Archive Subsystem\Archived Events/sec (Rate of successful event addition to the archive)
PI Archive Subsystem\Events Read/sec (Rate of archive events read)
PI Snapshot Subsytem\Snapshots/sec (Events sent to the snapshot) -> If this counter is non-zero and the Archived Events/sec it would possibly point at an issue with the Event Queue.
- The PI System Tray can be used for exactly that, but I can't recall if we do have a counters tracking errors / error gravity. Someone else might be able to chime in on this.
- KB00597 - PI ACE performance monitoring does a good job at detailing some ways to monitor PI ACE health.
You may also find the this thread useful
- The slightly outdated but still very good White paper on PI Performance Monitoring is available
- The great IT Asset Monitoring Toolkit and its related PISquare post
Thank you very much for the update.
We have tried implementing the above checks (point 1 &2), and it is working fine. We are also trying to get a perfmon interface installed for ACE server.
Meanwhile I have few queries w.r.to monitoring of PI Backup. We are trying to use '*_PI Backup Subsystem_Last Backup Failed' tag to determine the status of last backup. We have three kinds of backup configured in our server (Incremental, Differential and Full backup).
I would like to know which backup status, this PI tag corresponds to? Is it the incremental one?
Thanks for your help in advance.