POC Wrapup

Views:
 
Category: Entertainment
     
 

Presentation Description

No description available.

Comments

Presentation Transcript

Informatica Data Archive :

Informatica Data Archive Solution Overview, POC Summary & Results 25 th – 29 th July 2011

Informatica at a Glance:

Informatica at a Glance Founded: 1993 Headquarters: Redwood City, CA Employees: > 2,000 Offices: Americas, EMEA, Asia-Pacific (in 26 countries) Revenue: $650 million ( 2010) 5-year Compound Annual Growth Rate: 20% per year Customers: > 4,200 84 of Fortune 100 87%+ of Dow Jones Partners : Over 400 major SI, ISV, OEM and On Demand Singular focus on Information Management, Products and Services

The Data Growth Problem:

The Data Growth Problem Source: IDC Study on the Changing Enterprise Data Profile Growing storage and database license costs Increasing effort spent on maintenance/operations Diminishing performance Data retention/Compliance policies hard to implement Inactive data Active data Performance T I M E D A T A B A S E S I Z E

Slide 5:

Oracle MS SQLServer DB2 etc. Informatica Data Archive Application Archiving and Retirement Value Proposition Improve Application Performance Reduce Costs Improve Operational Efficiency Reduce Infrastructure Complexity

Managing Data Growth:

Managing Data Growth AFTER SOLUTION Improved, stable performance Predictable manageable growth Reduced maintenance & compliance work

Turkcell Centralised Archiving:

Turkcell Centralised Archiving Reduce costs and problems in managing data growth at Turkcell Complete solution for the Enterprise Single solution for all databases / applications ( eg . Siebel) Live archiving and retirement C entralised methodology for Turkcell Quickly manage all data growth issues Remove data management headaches Implement a data retention policy

Long Term Data Retention:

Current Data Long Term Data Retention Production Database Online Archive Database Seamless Access Layer Archived Data Current Data Current Data EMC File Archive Repository Manage the Information Lifecycle Manage retention at record level Relocate data as it loses business value Purge Data to ensure compliance Legal/Audit tagging and holds

Data Growth Analysis:

Data G rowth Analysis

Slide 10:

Development View

Slide 11:

Data Profiling

Data Archive Accelerators:

Data Archive Accelerators Comprehensive out-of-the-box content for leading packaged applications Functional entity definitions Future version support Business rule validation History upgrade when upgrading production Seamless access to archived data Application specific functionality

Informatica Data Archive Simple, Wizard-based Interfaces to Manage Archive Cycles:

Informatica Data Archive Simple, Wizard-based Interfaces to Manage Archive Cycles

Seamless Access Layer:

Seamless Access Layer Uses existing application user interface Dynamically generated database layer Leverages existing security Handles customizations and extensions Does not modify application code Supports third party query/report tools that currently access application info directly from the database Standard functionality for Oracle Applications, PeopleSoft, and Siebel

Typical maintenance costs:

Typical maintenance costs 2 OF 3 CIOs SAY THEIR ORGANIZATIONS DO NOT HAVE A SINGLE VIEW OF LEGACY SYSTEM DATA FOR COMPLIANCE REPORTING SOURCE: NCC survey companies with over 50 IT staff >50% OF APPLICATIONS ARE Legacy in Typical Enterprise Portfolios 70/30 IT BUDGET SPEND ON EXISTING VS NEW PROJECTS

Application Retirement Solution:

Application Retirement Solution Application Data Reports & data discovery portal BEFORE RETIRED Database Thin Client Reports Thick Client Operating System Hardware Application IT Staff Maintenance

File Archive Repository:

File Archive Repository Storage Optimized file format stored to disk Highly compressed (up to 40:1 ) Support for storage platforms like EMC Centera Access Supports native SQL Query without restore Supports leading Business Intelligence platforms Optimised query performance Supports Query Point in Time Security Data and structure are immutable Tamper detection, fully audited access Support data retention, expiration , legal holds

Customers:

Customers Avea invested in Informatica ILM Solutions to archive Siebel and other applications. They are also reducing costs in managing DEV/TEST environments for their DSF and BSCS Billing Systems. Vodafone Turkey recently selected ILM to manage growth with their CDR - CRM and billing data and applications

POC Results :

POC Results

POC Timeline:

POC Timeline Day 1 - Installation and Configuration Discuss scenarios Day 2 - Build Entities and business rules Complete Load to DB Archive Large Load to File Archive Day 3 - Complete criteria Run benchmarks Day 4 - Product Presentation Complete benchmarks Document results Day 5 - Wrap-up Meeting

POC Criteria: 100% Pass:

POC Criteria: 100% Pass

POC Highlights: File Archive Test:

POC Highlights: File Archive Test 26,906,510 records Total transfer time: 2:42 hours 13 GB in Oracle, compressed to 576 MB 95% Compression Maintained good query performance, a transaction select completes in ~3 seconds

POC Highlights: Performance:

POC Highlights: Performance Scenario Rule Archived Records Time CONTR_SERVICES Co_id between 1 and 1000 4673 0:02:51 CONTR_SERVICES Co_id between 1001 and 2000 15880 0:03:30 CONTR_SERVICES Co_id between 2001 and 3000 17749 0:04:27 CONTR_SERVICES Co_id between 3001 and 4000 16139 0:03:41 AEH No restriction rule 1406 0:00:14 Invoice CO_ID between 1020000 and 1021000 1021 0:05:57 Invoice CO_ID between 1000000 and 1010000 13449 0:06:10 Invoice CO_ID between 1000000 and 1010000 13449 0:06:10 Rateplan_hist CO_ID between 310000 and 311000 128 0:00:27 Rateplan_hist CO_ID between 10000 and 20000 1485 0:02:11 Rateplan_hist CO_ID between 10000 and 20000 1485 0:02:11 Tickler CO_ID between 10000 and 11000 1622 0:00:30 Tickler CO_ID between 110000 and 120000 16867 0:03:19 Tickler CO_ID between 110000 and 120000 16867 0:03:19

POC Highlights: Auto Discovery:

POC Highlights: Auto Discovery

Summary:

Summary Solution meets all Turkcell requirements Proven product and methodology suitable for a centralised enterprise solution ILM is a major focus area of investment by Informatica Local experience with technology and Turkcell applications like BSCS Software platform to compliment existing EMC hardware partner

Screenshot: Auto-Discovery of parent/child relationship:

Screenshot: Auto-Discovery of parent/child relationship

Screenshot: Scheduler:

Screenshot: Scheduler

Screenshot: Archive Workflow:

Screenshot: Archive Workflow

Screenshot: Granting / Revoking Access Rights and User Roles:

Screenshot: Granting / Revoking Access Rights and User Roles

Screenshot: Search/Browse of Archived Data:

Screenshot: Search/Browse of Archived Data

Screenshot: Manually defined parent/child relationships of archive:

Screenshot: Manually defined parent/child relationships of archive

Screenshot: Accessing Archive through JDBC/ODBC:

Screenshot: Accessing Archive through JDBC/ODBC

Backup Slides:

Backup Slides

Informatica at a Glance:

Informatica at a Glance Founded: 1993 Headquarters: Redwood City, CA Employees: > 2,000 Offices: Americas, EMEA, Asia-Pacific (in 26 countries) Revenue: $650 million ( 2010) 5-year Compound Annual Growth Rate: 20% per year Customers: > 4,200 84 of Fortune 100 87%+ of Dow Jones Partners : Over 400 major SI, ISV, OEM and On Demand Singular focus on Information Management, Products and Services

Slide 36:

ACTIVE Archive Process INACTIVE DB Entity Biz Rule 1 Biz Rule 2 Biz Rule 3 Archive ID Date Status Type 1 01-Jan-09 Open Prem N 2 04-Feb-05 Closed Econ Y 3 12-Apr-08 Open Econ N 4 27-Dec-06 Open Prem N 5 12-Feb-05 Closed Econ Y Build Interim Tables and Test Rules Copy Data to Staging Purge Data from Production Copy Data to Archive Drop Inactive Data Production Archive INACTIVE INACTIVE

Identifying Data to Archive:

Identifying Data to Archive Business Rules Transaction chaining Within an entity To other applications Testing of Fields, Flags & Codes Entity Definition Logical unit to archive Database and application level relationships Policy scoping criteria CONFIDENTIAL

Archive Process (vs. ‘Manual’ Archiving):

Archive Process (vs. ‘Manual’ Archiving) “Live” Process Uses dynamically generated SQL Runs on the production database server Single engine relocates to multiple formats Complete logging and audit trails Schedulable, repeatable process Re- startable in case of interruption Data structure synchronization process CONFIDENTIAL

Restore Process - Features:

Restore Process - Features Standard functionality Multiple options All archived data Criteria based Archive Cycle Single transaction “Undo” functionality Always tested, rarely used CONFIDENTIAL

Seamless Data Access – Database Layer:

COMBINED History Transactional Tables ARCHIVE_ONLY Seamless Data Access – Database Layer Production Archive / History Current Only – Majority of Users Archive Only – Archived transactional data and current master data Combined – Current + archived transactional data Data Access Options Seamless Access Layer Union View Applications

Slide 41:

Informatica Data Archive ILM Archive Repository ILM Engine File Archive Service BI Tools DataDiscovery JDBC ODBC/ JDBC PROD Data Adapters Native Connectivity ILM Engine is a J2EE Application Server (using Apache Tomcat) ILM Archive Repository is a database (typically Oracle or SQL Server) ILM File Archive Server is 64-bit Application Server running on Windows/Linux/Unix EMC Centera ARCH Data Adapters Native Connectivity

Slide 42:

SYSADM_COMB AM_STAGE SYSADM AM_STAGE SYSADM3 ILM Engine and UI File Archive Service Optimised File Archive BSCS Oracle 10G Database SYSADM: Source Data AM_STAGE: Temporary area SYSADM_COMB: Seamless Access Schema BSSARCH Oracle 11G Database SYSADM3: Archived Data Schema AM_STAGE: Temporary area for restore ILM Engine : J2EE Web Application Server Optimised File Access: Files in C:\INFA\ARCHIVE WINDOWS 2K8 R2 Machine