ITIL - Introducing Service Design PDF
ITIL - Introducing Service Design PDF
ITIL - Introducing Service Design PDF
The service design stage of the lifecycle starts with a set of new or changed business requirements and ends with the development of a service solution designed to meet the documented needs of the business. This developed solution is then passed to service transition to evaluate, build, test and deploy the new or changed service. Other objectives include: Design services to satisfy business objectives, based on the quality, compliance, risk and security requirements, delivering more effective and efficient IT and organisation solutions and services aligned to organisational needs Design services that can be easily and efficiently developed and enhanced within appropriate timescales and costs and, wherever possible, reduce, minimise or constrain the long term costs of service provision Design efficient and effective processes for the design, transition, operation and improvement of high quality IT services, together with the supporting tools, systems and information, especially the service portfolio, to manage services through their lifecycle Design secure and resilient IT infrastructures, environments, applications and data/information resources, and capability, that meet the current and future needs of the organisation Design measurement methods and metrics for assessing the effectiveness and efficiency of the design processes and their deliverables Produce and maintain IT plans, processes, policies, architectures, frameworks and documents for the design of quality IT solutions, to meet current and future agreed organisation needs Assist in the development of policies and standards in all areas of design Develop the skills and capability within IT by moving strategy and design activities into operational tasks, making effective and efficient use of all IT service resources Contribute to the improvement of the overall quality of IT service within the imposed design constraints, especially by reducing the need for reworking and enhancing services, once they have been implemented in the live environment
U C I S A
I T I L :
I N T R O D U C I N G
S E R V I C E
D E S I G N
Reduced Total Cost of Ownership (TCO) Improved quality of service Improved consistency of service Easier implementation of new or changed services Improved service alignment More effective service performance Improved IT governance More effective service management and IT process Improved information and decision making
In order to deliver the benefits of service management and ITIL, these 4 Ps need to overlap each other, a popular misconception is that 1 P will fix all the rest, and many organisations believe this is the product. There has to be a balance of all 4 Ps to ensure the right mix for the appropriate design. Many designs, plans and projects fail through a lack of preparation and management. The implementation of ITIL service management as a practice is about preparing and planning the effective and efficient use of the four Ps: the People, the Processes, the Products (services, technology and tools) and the Partners (suppliers, manufacturers and vendors).
U C I S A
I T I L :
I N T R O D U C I N G
S E R V I C E
D E S I G N
U C I S A
I T I L :
I N T R O D U C I N G
S E R V I C E
D E S I G N
Definition Matching and customising (with the customer) of the right service provision against the right costs: Service catalogue Demands of the customer (service level requirements)
Agreement (defining and signing SLAs) Service Level Agreements, supported by: Operational Level Agreements (OLAs) and underpinning contracts
Monitoring Measuring the actual service levels against the agreed service levels
Reporting Reporting on the service provision (to the customer and the IT organisation)
Evaluation (review) Evaluate the service provision with the customer Match and customise: adjust service provision if required? (service improvement plan/programme, service quality plan) Match and customise: adjust SLA if required?
U C I S A
I T I L :
I N T R O D U C I N G
S E R V I C E
D E S I G N
Service spec sheets (service specifications) Connection between functionality (externally/customer focused) and technicalities (internally/IT organisation focused)
Service catalogue Service level management must ensure that a service catalogue is produced, maintained and contains accurate information on all operational services and those ready for deployment. A service catalogue is a written statement of all current and available IT services, default levels and options. SLA (Service Level Agreement) The written agreement between the provider and the customer (organisation representative). Service level achievements The service levels that are realised. SIP (Service Improvement Plan) This is a formal plan or program that is developed when the IT service provider is not currently delivering a service that meets the legitimate Service Level Requirements (SLRs) of the business representative or when greater cost effectiveness is achievable. The SIP should include clear milestones, which will enable the business representative to judge whether or not timely progress is being made. SQP (Service Quality Plan) not specifically an SLM term, but strategically linked This plan underlies the service strategy, detailing the internal targets to be achieved within an agreed period, typically one to two years, to improve agreed service levels and the business perception of service quality: Management information for steering the IT organisation Process parameters of the service management processes and the operational management Key performance indicators:
zz zz
Incident management resolution times for levels of impact Change management processing times and costs of routine changes
OLA (Operational Level Agreement) A written agreement with another internal IT department to support the SLA. UC (Underpinning Contract) A written agreement with an external IT supplier.
U C I S A
I T I L :
I N T R O D U C I N G
S E R V I C E
D E S I G N
Capacity management
Why have capacity management?
Every application makes its own demands on the IT environment. Some are unavoidable, such as the continuing spread of applications for Enterprise Resource Planning (ERP), supply chain management or human resources management. Also, new applications are emerging (e.g. those employing multimedia content) that will impact IT with their heavy demands for bandwidth. Finally, additional applications are required to support the growing IT infrastructure of an organisation (e.g. remote storage of back up data). Failure to consider these issues will lead to negative effects on the business, as the capacity of the IT environment simply does not match the requirements of the business. Balancing costs and supply against demand Balancing costs against resources needed
U C I S A
I T I L :
I N T R O D U C I N G
S E R V I C E
D E S I G N
Analysis Tuning Modifications made for better utilisations of current infrastructure (under the control of change management) Implementation Monitoring
U C I S A
I T I L :
I N T R O D U C I N G
S E R V I C E
D E S I G N
Availability management
Why have availability management?
Gartner research shows that people and/or process failures directly cause an average of 80% of mission critical application service downtime. The other 20% is caused by technology failure, environmental failure or a disaster. The complexity of todays IT infrastructure and applications makes high availability systems management difficult. Applications requiring high levels of availability must be managed with operational disciplines (including network monitoring, systems management activities etc.) to avoid unnecessary and potentially devastating outages.
Reactive activities Monitoring, measuring, analysis and management of all events, incidents and problems involving unavailability Continually optimise and improve availability of IT infrastructure services Assisting security and ITSCM in the assessment and management of risk Attending CAB as required
Determining availability requirements Input from service level management. The Service Level Manager discusses with the client what their needs for service are (service level requirements). Based on these requirements, the Availability Manager can determine the availability requirements. Determining vital business functions (VBFs) Some organisations processes are more critical than others. IT systems that support VBFs should have higher availability expectations and the appropriate systems and support to achieve and sustain these higher levels at critical times which have been agreed and documented in the SLA. Business impact analysis A formal analysis of the affect on the business, if a specific set of IT services are not available. It will also identify the minimum set of services that an organisation will require to continue operating. Risk analysis management (input for IT service continuity management) this activity involves trying to identify the impact to the business of service unavailability. It is part of our risk analysis and risk management activities. The outcomes of these activities are utilised in various processes: availability management, IT service continuity management and information security management.
Defining availability, reliability and maintainability targets Input for Service Level Agreements (SLAs) and other contracts. Based on the first three steps, we are now ready to create the achievable and sustainable availability targets. In order to do this, we need to get input from all technical areas within the IT group. (as well as suppliers).
U C I S A
I T I L :
I N T R O D U C I N G
S E R V I C E
D E S I G N
Monitoring and trend analysis MTBF (Mean Time between Failures) MTBSI (Mean Time between System Incidents) MTRS (Mean Time to Restore Services) Planned downtime, unscheduled downtime, extended (excess) downtime Frequent (scheduled) backups
Root cause analysis of low availability Relationship with the problem management process
Producing and maintaining an availability plan This plan has to be updated at least once every year! Reporting
Maintainability: The ability of the IT group to maintain the IT infrastructure in operational state and available according to the agreed service levels. Security: Resilience: Confidentiality, Integrity and Availability (CIA) of data. The ability of individual components to absorb or be flexible in times of stress.
U C I S A
I T I L :
I N T R O D U C I N G
S E R V I C E
D E S I G N
The following metrics are commonly used in availability management: Mean Time to Restore Services MTTRS: average time between the occurrence of a fault and service recovery (or the downtime) Mean Time Between Failures MTBF: mean time between the recovery from one incident and the occurrence of the next incident Mean Time Between System Incidents MTBSI: mean time between the occurrences of two consecutive incidents. The MTBSI = MTTR + MTBF
The ratio of MTBF to MTBSI shows if there are many minor faults or just a few major faults. Availability reports may include the following metrics: Rate of availability (or unavailability) in terms of MTRS, MTBF and MTBSI Over all uptime and downtime, number of faults Additional information about faults which actually, or potentially, result in a higher than agreed unavailability
U C I S A
I T I L :
I N T R O D U C I N G
S E R V I C E
D E S I G N
10
The diagram, on the previous page, shows the four stages of ITSCM, incorporating each of the activities that take place to ensure that IT organisations are as prepared and organised as possible in the event of a disaster situation. Stage 1 is really an activity that has to be done by the business, so IT can figure out what it is that BCM do? The IT organisation has to provide details with how they will support this BCM by the continuation of delivery of IT services in times of crisis and disaster. Two of the major data sources for ITSCM are developed within requirement and strategy, including business impact analysis and risk assessment. Stage 1: initiation Link with business continuity plan Policy setting Terms of reference and scope Allocate resources
Stage 2: requirements and strategy Input from availability management and security management (risk assessment) Business impact analysis Discuss recovery options (link to SLM)
Emergency response plan Damage assessment plan Salvage plan Crisis management and PR plan
Implement standby arrangements Implement recovery options Test the plans Develop and implement procedures and working instructions
Stage 4: operational management Link ITSCM to change management to keep plans and recovery options up to date IT staff need to be aware and trained to use the plans Continuous improvement of the process through review and testing
11
Gradual recovery (cold standby): an empty room available (in house or outsourced service), mobile or fixed, where IT infrastructure can be rebuilt. (Takes longer than 72 hours to recover). Intermediate recovery (warm standby): a contract with a third party/supplier recovery organisation to use their infrastructure in a contingency situation. Backup tapes should be available at the crisis site at all times. (Takes 24 to 72 hours to recover). Fast recovery (hot standby): this option (sometimes referred to as hot standby) provides for fast recovery and restoration of services, and is sometimes provided as an extension to the intermediate recovery provided by a third party recovery provider. Where there is a need for a fast restoration of a service, it is possible to rent floor space at the recovery site and install servers or systems with application systems and communications already available, and data mirrored from the operational servers. In the event of a system failure, the customers can then recover and switch over to the backup facility with little loss of service. (This typically involves the re-establishment of the critical systems and services within a 24 hour period). Immediate recovery (also known as hot standby): a full duplication of system (minus components) for instantaneous recovery. This option (also often referred to as hot standby, mirroring, and load balancing or split site) provides for immediate restoration of services, with no loss of service. For business critical services, organisations requiring continuous operation will provide their own facilities within the organisation, but not on the same site as the normal operations. Sufficient IT equipment will be dual located in either an owned or hosted location to run the complete service from either location in the event of loss of one facility, with no loss of service to the customer. The second site can then be recovered whilst the service is provided from the single operable location. This is an expensive option, but may be justified for critical business processes or VBFs where non-availability for a short period could result in a significant impact, or where it would not be appropriate to be running IT services on a third partys premises for security or other reasons. The facility needs to be located separately and far enough away from the home site, that it will not be affected by a disaster affecting that location. However, these mirrored servers and sites options should be implemented in close liaison with availability management, as they support services with high levels of availability.
U C I S A
I T I L :
I N T R O D U C I N G
S E R V I C E
D E S I G N
12
Security controls
Preventive Preventing security incidents from occurring
U C I S A
I T I L :
I N T R O D U C I N G
S E R V I C E
D E S I G N
13
U C I S A
I T I L :
I N T R O D U C I N G
S E R V I C E
D E S I G N
14
Supplier management
Why have supplier management?
The supplier management process ensures that suppliers and the services they provide are managed to support IT service targets and organisation expectations. It is essential that supplier management processes and planning are involved in all stages of the service lifecycle, from strategy and design, through transition and operation, to improvement. The complex organisation demands require the complete breadth of skills and capability to support provision of a comprehensive set of IT services to a business. Therefore, the use of value networks and the suppliers (and the services they provide) are an integral part of any end to end solution. Suppliers and the management of suppliers and partners are essential to the provision of quality IT services. The purpose of the supplier management process is to obtain value for money from suppliers and to ensure that suppliers perform to the targets contained within their contracts.
15
other information and relationships with other associated CIs. This will also contribute to the information held in the service portfolio and service catalogue. The information within the SCD will provide a complete set of reference information for all supplier management procedures and activities.
Although supplier management is firmly placed within the service design Phase of the lifecycle, some of the activities are carried out in the other lifecycle phases too. Supplier categorisation and maintenance of the SCD (occurs within the service design phase) Evaluation and setup of new Suppliers and contracts (occurs within the service design phase) Establishing new suppliers (occurs within the service transition phase) Supplier and contract management and performance (occurs within the service operation phase) Contract renewal and termination (occurs within the service operation phase)
U C I S A
I T I L :
I N T R O D U C I N G
S E R V I C E
D E S I G N
16
U C I S A
I T I L :
I N T R O D U C I N G
S E R V I C E
D E S I G N
17