ITCAM for Web Resources (WR) ITCAM for Response Time (RT) Brent Dorenkamp IT Specialist bdorenka@us.ibm.com TIVOLI BOARD OF ADVISORS © 2007 IBM Corporation Tivoli Software ITCAM – Comprehensive Application Management Solution Monitor End User Response Time Monitoring Monitor Transactions Root-Cause Problem Analysis & Resolution Application Tracking & Topology Analysis L3 SME and Diagnostics Tivoli 2 Application Resource Monitoring LOB / Operations Buyer Problem Identification Health Monitoring Basic SLA reporting Application Owner / Operations Buyer Problem Isolation Response time monitoring across multiple components SME Buyer Root Problem Determination Response time within application server Take action to resolve problem Composite Application Manager © 2007 IBM Corporation Tivoli Software ITCAM for Web Resources Introduction IT Operations via TEP Monitor OS Monitor ITM Monitor App Resources & J2EE Applications ITCAM for Web Resources (J2EE App & Web Servers) SME via Web UI Deep Dive & Fix ITCAM for Websphere/J2EE ITCAM for WR provides a more affordable and less complex J2EE application monitoring solution for IT Operations who want: To quickly identify, isolate problems and route to the appropriate SME. Resource monitoring to be proactive in problem identification and eliminate the need for cross-SME teams to resolve issues. To centralize monitoring and reduce reliance on SMEs. ITCAM for WR provides resource and application monitoring with the Tivoli Enterprise Portal (TEP) user interface. ITCAM for WR has easy out of the box installation - installs and deploys in a few hours! ITCAM for WR has improved operations dashboards with needed data to quickly identify the source of the problem. 3 © 2007 IBM Corporation Tivoli Software ITCAM for Web Resources – New Feature Overview Simplified Installation –Installation launch pad for all supported resources and silent install of infrastructure (TEP/DB2) Auto Learning of Thresholds –Used to benchmark performance for a new application (or a changed environment) to understand where thresholds should be established Application dashboard –Summary view of applications to give operators and administrators, at a glance, the ability to see where the problem is located (client tier, application tier or backend tier). These out of the box views are per application server; however, logical views can be customized to include two or more application servers, and can be customized to include a view of the entire application stack, to include databases, web servers and OS resources. Java Standard Edition (J2SE) Workspaces –monitors stand alone java applications Best practices documentation for logical views (Delivered via OPAL) –Logical views allow disparate resources to be grouped together –Resources can be viewed by geographic, functional, or relationship-based groups. –Scalable background images/bitmaps Situation Scripting: – Predefined situations that include preset thresholds, sampling intervals, boolean logic, and expert advice (Delivered via OPAL) –Creation of new situations available for customers to correlate any related data that will alert operations to a potential problem. For example: Correlate average response time alerts with an increase in CPU usage, to alert operations to a possible memory leak. Correlate growth in HTTP sessions with an increase in average response time to an increase in JVM memory usage. Alert operations to check timeout sessions for HTTP sessions 4 © 2007 IBM Corporation Tivoli Software Supported Platforms & Workspaces Application Servers – WebSphere – Tomcat – JBOSS – Weblogic – Oracle – SAP Netweaver – Websphere ESB – WebSphere Portal Server – WebSphere Process Server – Lotus Workplace Server Web Servers – Apache – Internet Information Services (IIS) – iPlanet Application Server TEP Workspaces – – – – – – – – Application Health Summary –Cache Analysis Client Tier Analysis –Workload Management Application Tier Analysis –Web Services Backend Tier Analysis (WebSphere only) Application Health History –J2SE Application Configuration Server Health Summary Request Analysis Datasources JMS Summary Web Applications EJB Containers – Pool Analysis DB2 Connection Pools J2C Connection Pools Thread Pools – Garbage Collection and Allocation Failure Analysis Web Server TEP Workspaces – Web Server Summary – Active Server Pages (ASP) – Web Sites 5 © 2007 IBM Corporation Tivoli Software ITCAM for Web Resources Workspaces 6 © 2007 IBM Corporation Tivoli Software Server Health Summary Enhanced workspace to include paging rate, GC rate, pool size, thread pool usage Metric summary per application server. Create logical views to compare/contrast clusters. Summary statistics indicate overall health of the application server. 7 © 2007 IBM Corporation Tivoli Software Application Health Summary Dashboard NEW workspace for views by application Mouse over to view events At a glance, operators can see what is going wrong with each application. Quick indication if the problem is in the application tier (EJB/JMS/ORB), client tier (servlet) or backend tier (JBDC). 8 Thresholds fire situations to indicated good, fair, poor status. Sampling rates can be configured per application. © 2007 IBM Corporation Tivoli Software Drill Down to the Client Tier (Servlet) NEW workspace for views by client tier per application Top 5 delays and completion rates per application JMS summary by app. server HTTP Session and Web Container per app. server 9 © 2007 IBM Corporation Tivoli Software Drill Down to the Application Tier (EJB, ORB, JMS) NEW workspace for views by application tier per application Top 5 delays and completion rates per application JMS summary by app. server ORB Container and Transactions by app. sever 10 © 2007 IBM Corporation Tivoli Software Drill Down to the Backend Tier (JDBC) NEW workspace for views by backend tier per application Worst delays and most used JDBC and JMS Resources by Application JMS summary by app. server JDBC and JCA Pool Usage by Application Server 11 © 2007 IBM Corporation Tivoli Software Baseline Threshholds NEW workspace for setting thresholds based on historical analysis Use out of the box thresholds for good, fair and poor response times or set thresholds based on the application baseline, i.e., what is normal for that application. 12 © 2007 IBM Corporation Tivoli Software Request Analysis Operations teams track the average response time for requests processed on the app server, and can quickly detect issues when delays increase over time or spike. This workspace shows the worst average response times broken down by Java Component Response Times: Application, JCA, JMS, JNDI, SQL connection/query/update Tabular data set with drill-down response time values for JCA, JMS, JDBC 13 © 2007 IBM Corporation Tivoli Software Pool Analysis J2EE resource pools are critical in terms of providing availability to commonly accessed services such as database access and other container pool types. This workspace enhances PMI data with configuration data to provide a comprehensive overview of requests flowing through WebSphere “funnel”. Comparison of recent active threads in ORB pool Visual correlation of CPU utilization vs. pool consumption Web container pool statistics showing # times at maximum capacity DB2 and J2C connection pools at full saturation 14 © 2007 IBM Corporation Tivoli Software Garbage Collection Analysis Garbage Collection (GC) metrics such as frequency and time to complete can have a large effect on application server performance (during this time no other application processing can take place). This workspace shows a detailed breakdown of GC behavior and provides an complete analysis of GC performance metrics. ` Collection Rate # GC’s per Minute 15 Recent JVM Heap Usage Trend Detailed Analysis of Recent GC Performance % Time Spent in GC Cycle © 2007 IBM Corporation Tivoli Software Cache Analysis Highlights in-memory cache sizes, a shows cache templates with highest miss rates Detailed tabular views of cache metrics for analysis and tuning Miss rates correlated with recent cache size trends 16 IBM Confidential © 2007 IBM Corporation Tivoli Software Single Console Allows Faster Incident Management DB2 Activity and System Memory charts WAS Response Time, Throughput, and JVM and System CPU Usage charts WAS Request Breakdown, Database Wait and Processing Times, and Heap Usage charts –In the TEP you can customize workspaces to view OS, Database, Messaging, and Application server resources to quickly pinpoint common resource problems. 17 © 2007 IBM Corporation Tivoli Software Correlation of events/automate take action using scripts Correlate situations, and assign expert advice and actions on a situation. 18 Correlation of situations – how to & example situations – will be available on OPAL in May. Examples of situation correlation include: – Correlate average response time alerts with and increase in CPU usage, to alert operators to investigate a a memory leak. – Correlate alerts for increased activity on the server to ensure JVM memory usage is not impacted. Correlate growth in HTTP sessions with an increase in average response time to an increase in JVM memory usage. Alert to check timeout sessions for HTTP sessions set to appropriate level to help reduce impact on memory. © 2007 IBM Corporation ITCAM for Response Time v6.2 TIVOLI BOARD OF ADVISORS © 2007 IBM Corporation Tivoli Software Current State Of IT Application Management IT organizations are under tremendous pressure to deliver results. IT Challenge Business Impact Majority of IT problems are still being identified by end user complaints Lost business when problems go undetected; impacts customer satisfaction Composite application performance problems can take many hours to isolate Support budget spent isolating problems instead of fixing them Once problem is identified a large team of specialists is required to resolve the problem Large amount of money from both development and support spent solving problems Most application problems go back to development group, which cannot recreate the problem Development budget spent fixing problems instead of building new applications … What we need is comprehensive application management 20 © 2007 IBM Corporation Tivoli Software End User Monitoring is the Best Way to Determine what IT Problems need Attention What is the Customer Experience?... Web server thread pools are full - is it impacting the customer response time? … Do you know what your customers are experiencing? Or are they calling your help desk to tell you that you have an availability or response time problem. Effective application management requires monitoring both application resources and response times to find the real problems that are impacting your business 21 © 2007 IBM Corporation Tivoli Software Workflow for Managing Composite Applications Problems Sense Detect that a threshold has been breached and that a problem occurred, or is about to happen Isolate Diagnose Repair Pinpoint the problem to a specific part of the environment and hand-off to the appropriate specialist Drill down into the details and get to the root cause of the problem Fix the faulty component, validate the fix and roll back into production ITCAM for RT: End-User Response Monitoring 22 © 2007 IBM Corporation Tivoli Software ITCAM for Response Time What is it? and Why I Need it! What is ITCAM for Response Time (RT) – 23 Tivoli Enterprise Portal (TEP) based solution that provides IT Operations with both real time and robotic monitoring of the end user response time experience to help quickly identify SLA breaches and to help proactively prevent future violations. Highlights – Provides comprehensive response time coverage for both Web and Windows applications using a variety of robotic and real-time analysis – Integrates with the Tivoli Enterprise Portal (TEP), a portal-based customizable user interface that can bring together response time data from ITCAM for RT and IBM Tivoli Monitor (ITM) resource data in an easy-touse interface to quickly identify what resource bottlenecks are impacting the end user experience – Integrates with other IBM Service Management (ISM) and Tivoli products to provide complete automated end-to-end management of your applications © 2007 IBM Corporation Tivoli Software ITCAM for Response Time The Best of Both Types of Response Time Monitoring in One Integrated UI Synthetic Transactions – Synthetic playback of all robotic scripts Via Rational Robot, RPT, Mercury LoadRunner, Command Line Interface Real End User Transactions – 24 Robotic Response Time Monitoring Web Response Time Monitoring Monitors real end user web transactions (HTTP/S) – Client Response Time Monitoring Monitor real end user client Windows application transactions i.e. Lotus Notes, Microsoft Outlook, SAP, 3270, etc © 2007 IBM Corporation Tivoli Software Availability Dashboard Shows overall enterprise status for all Clients and Applications Quickly identify the Top 5 worst performing Clients and Applications, and drill down to identify the problem… 25 © 2007 IBM Corporation Tivoli Software Application Service Level Metrics View availability over time to quickly see when a problem started Compare with application load to see if a spike in volume could have contributed to the problem Is your Application’s Uptime and Downtime improving or getting worse? 26 © 2007 IBM Corporation Tivoli Software Applications Quickly identify the Top 5 worst performing applications Drill into a specific Application for more details… 27 © 2007 IBM Corporation Tivoli Software Application Summary How has the application been performing over time? 28 © 2007 IBM Corporation Tivoli Software Transaction Summary View the application’s business transactions and how they have been performing over time Drill into the worst performing transaction for more details… 29 © 2007 IBM Corporation Tivoli Software ITCAM for Response Time v6.2 Release Highlights Unified Infrastructure and User Interface • Single infrastructure built on ITM • Single, consolidated user interface built on Tivoli Enterprise Portal (TEP) Improved Consumability to Enhance Ease of Use and Time to Value • Fully customizable dashboard, reports and workspaces • Simplified configuration, including default Situations • Simplified installation • Intelligent alerting based on ITM powerful situations editor • Configurable data aggregation as low as every 5 minutes Enhanced Response Time Monitoring • Report & alert on any real time or historical response time metric • Identify response time bottlenecks by Client, Network or Server times • Identify, report & alert on individual clients or locations • Discover, report & alert backend server resources • Improved robotic monitoring w/ Rational Performance Tester (RPT) • Immediate playback of robotic scripts • Custom ARM application response time monitoring • Improved CLI functions to edit configuration Deliver IBM Service Management Foundation Elements • CCMDB discovery & real time status of Business Processes & Business Activities 30 You can install ITCAM for RT and have it show real web response time data within minutes! (Really!) ITCAM for RT is so easy to run on top of ITM that even a (you guessed it) A Neanderthal could do it ITCAM for Response Time © 2007 IBM Corporation Tivoli Software Questions? 31 © 2007 IBM Corporation
© Copyright 2024