Monitoring and Debugging of High Performance Distributed Heterogeneous Cloud Applications
The research project is decomposed into 4 tracks.
- Distributed Applications in the Cloud, Edge and 5G network
- Heterogeneous Multi-Core Coprocessors
- Machine Learning and performance analysis Tools
- New Architecture for Tool integration
Automated monitoring and debugging of large scale manycore heterogeneous systems
The AMDLS research project is decomposed into 4 tracks.
- Data collection through the whole hardware/software stack
- Architecture for real-time and scalable system-level and cloud-level monitoring and analysis
- Anomaly Detection and diagnosis with Machine Learning
- Tracing and debugging support for advanced programming environments
Software Debugging and Monitoring for Heterogeneous Many-Core Telecom Systems (HSDM)
HSDM project is decomposed into 6 sub-projects which are detailed in this page. Tracks 1 to 5 are part of the base NSERC Cooperative Research and Development project, while track 6 is to be financed by Prompt:
- Tracing, Debugging and Profiling Mechanisms and Architecture on Many-Core Systems
- Cloud Debugging and Monitoring
- Advanced analysis
- Parallel and Incremental Analysis
- Model-Driven Engineering support
- Case Studies
Diagnostics for Real Time Distributed Multi-core Architecture in Avionics
This project has four research axes:
- Tracing and Sampling for Real-Time partially simulated Avionics Systems
- Analysis of Real-Time Avionics Systems from Tracing and Sampling data
- Trace abstraction and correlation techniques for real-time avionic systems
- Visualization of Avionic System Traces
Online surveillance of critical computer systems through advanced host-based detection
This project has four research axes:
- Scalable Observation infrastructure - Low disturbance multi-level observation and production of enhanced data
- Scalable Observation infrastructure - Advanced host-based Centralized data store and software pattern identification
- Scalable Detection infrastructure - Harmonized Anomaly Detection Techniques
- Scalable Detection infrastructure - Knowledge base for the Linux kernel
Integrated tracing, profiling and debugging for tuning large heterogeneous clusters
This project has four research axes:
- Tracing the whole hardware infrastructure, from network processors to application-specific many-core processors
- Coordinating multiple sources of monitoring information in the cluster
- Cluster level modeling and analysis
- Integration of Tracer in Cloud Computing Environment
Tracing and Monitoring Tools for Distributed Multi-Core Systems
This three-year project is jointly funded by Ericsson, Defence Research and Development Canada et le National Sciences and Engineering Research Council of Canada. Lead by École Polytechnique's DORSAL, it is being done is collaboration with Concordia University, Laval University and the Université of Ottawa. It is composed of seven research axes:
- Adaptative fault probing
- Multi-level, multi-core distributed traces synchronization
- Trace Abstraction, Analysis and Correlation
- Automated Fault Identification
- Trace Directed Modeling
- System Health Monitoring and Reactive measures activation
- Tracing and Monitoring Framework Impact Prediction