NEW - IDC WebcastFree E-NewslettersRSS Feeds | Site Map
Security Resource CentreBusiness Value of TechnologyMunicipal Centre
SearchSearch
Tips
Program
Slice by Program

EXCLUSIVE: What went wrong at Canada Revenue Agency

By: Lisa Williams, senior writer, InterGovWorld(04-02-2007)

Treasury Board Secretariat has launched a full investigation with Canada Revenue Agency into the software bug that brought down the national e-tax filing system for 10 days earlier this month.

It was the longest stretch of time that a mission-critical system at CRA had been out of service. Canada Revenue says it had an army of 700 employees working to get the system back up and running, including IT staff who worked around the clock.

CRA has confirmed the system crash was caused by a vendor-issued software patch for its database management system, Advantage CA-IDMS (Integrated Database Management System), manufactured by CA Inc.

A spokesperson for CA did not give details on why or how the patch malfunctioned, but he did say the company helped to restore system availability. "CA worked closely with CRA to resolve the problem and restore service," said Fabrice Zambito, regional vice-president, CA Canada.

Amidst the crisis came scathing criticism from Liberal Revenue critic Judy Sgro, who accused federal Revenue Minister Carol Skelton of ignoring the system crash while away in Saskatchewan. Sgro called for an extension to the tax filing deadline if the system remained down for much longer.

Treasury Board confirmed it was investigating the case with CRA officials. "The Chief Information Officer Branch has worked closely with the Canada Revenue Agency, as it does with all departments, to support the Agency in working through this event," said a spokesperson. "One of Treasury Board Secretariat's roles is to assist departments in monitoring and addressing arising issues."

Gordon O'Grady, deputy assistant commissioner with the IT branch at CRA, talked this week with InterGovWorld senior writer Lisa Williams about the main culprit of the glitch, why it took so long to fix the faulty patch, and the processes behind its post mortem with Treasury Board.

Q) The Commissioner of CRA (Michel Dorais) had said that it was a malfunctioning software patch that was the cause of the service disruption. Can you give me the details on the patch itself?

A) The patch was created by the vendor (CA Inc.) to address a problem related to their database management software, which if encountered would cause database processing to stop, resulting in a service disruption.

Q) What was the type of software that was being used?

A) It was database management system software called Advantage CA-IDMS (Integrated Database Management System).

Q) How was the malfunction noticed?

A) On March 5, (the day after the patch was applied), there were sporadic anomalies that were reported, occurring in the Efile system and other integrated management system databases, but as identified earlier, all were predominantly individual taxpayer databases.

Q) What was the specific testing procedure that was in place?

A) At CRA, patches are thoroughly tested in multiple test environments; so they're progressively released through each environment, until they're finally released in production. We usually allow two weeks between each release on each test environment before we go into production.

Q) When you say two weeks is allowed, do you mean in terms of the testing itself?

A) There are various test environments or test states. Each testing that's done spans the two-week period before it's then released to the next test environment. So there's a period of stabilization before it moves to the next stage in testing.

Q) This is viewed as a mission-critical service and is the time of year when people are normally filing their taxes. It's also the longest time online services have been down on the CRA Web site. Why did it take so long to get things up and running?

A) Due to the unprecedented scale and scope of this incident, it had a significant impact on the level of effort and time required to get all IT services back to normal production status. As we indicated earlier, it was mostly impacting individual IT services that included more than 76 computer applications, comprising more than 7.5 million lines of code and interfacing with 91 different databases.

Q) How big of a team was working on this?

A) Our IT staff worked around the clock, as you would expect, to resolve the problem. Due to the complexity and size of databases and systems, this was the amount of time (10 days) required to restore services. We estimate that 700 CRA employees worked on this problem.

Q) Were there other departments that were involved or that you had to consult with?

A) The IT part of the branch in the agency did of course work with business branches in the agency to make sure that the recovery plan did include business impact. We called upon the expertise and advice of key CIOs and other federal agencies and departments who are business partners. Together with our business partners, we worked in cooperation to manage the possible impact on other government organizations and coordinate the implementation sequence and all implementation of key services back into production.

Q) Now that the systems are up and running, do you think that your branch will be looking at amending best practices or changing procedures?

A) We're currently working with Treasury Board on a post-mortem document as a result of the outage. And I'd just like to say that CRA's IT branch follows a set of well-established, rigorous frameworks that ensure risk is proactively managed and that process and quality in data integrity are respected and safeguarded across all our operating environments. These management processes are derived from the ITIL (Information Technology Infrastructure Library) framework, which is a framework of industry best practices in IT.

Related content:

E-tax glitch has Canada Revenue stumped

Revenue Minister Skelton lambasted over e-tax failure

E-tax systems up and running, says Canada Revenue Agency

Bookmark on:del.icio.us| Digg it| Furl| Google| Technorati| StumbleIt| Yahoo!

Have something to say about this article?
Add a new commentLetter to the Editor
Find an inappropriate comment? You can notify the moderator by clicking the Report an innapropriate comment icon.
ADD A COMMENT
Name:*Your email address will not appear online and will be used only in the event that the editor wishes to contact you personally for additional comment.
City:
Email:
Title:*
Comment:*
* required fields
Blog Spotlight: Sandford Borins
Sandford Borins

As Professor of Strategic Management at the University of Toronto, Sandford Borins brings InterGovWorld.com readers exclusive insights into how and why the public sector is changing. You'll find new perspectives and questions, observations and objectives, lessons and answers. Cover to Cover, the blog by Prof. Sandford Borins, appears every Thursday.

Inside Cover to Cover

Unified Communications
Data Defence

Unity is a word often heard in the public sector, with myriad agencies and departments looking to foster collective thinking around some of today's most pressing issues. The word, however, doesn't usually get mentioned in the same breath as technology. That's a situation, though, that might soon be changing, thanks to a new software platform known as unified communications.

Inside the latest issue of CGR

More Resources
Driving innovation through effective service management
This white paper discusses how a service-oriented governance framework can help ensure that IT decisions are consistent with business vision, values and strategies-and that IT delivers maximum value to the business. Complimentary with registration.
IT Service Management Solutions and the service desk
This white paper presents the capabilities of IBM Tivoli CCMDB, and describes how Tivoli CCMDB extends the value of the service desk and integrates other essential ITIL processes in support of IBM Service Management. Complimentary with registration.
Stalled PCI DSS compliance efforts put Canadian organizations in limbo: Hereb�s how to get back on track
You might have long ago abandoned your efforts to achieve full PCI DSS compliance, but herebs a report that offers some helpful ideas to get back on track again. It highlights the five bsticking pointsb that typically hinders PCI DSS compliance progress and suggests how to get unglued from the mess.
Advertisement
2007 Salary Calculator
Knowledge Centres at a Glance
White Papers
read more white papers
New blog entries
Thoughts of the day
This week's top stories
Most popular stories of the week
Readers write back
Comments from Intergovworld readers
Government to government
Inside the public sector machine
Government to business
P3: Public-private partnerships
Government to citizen
e-Government service transformation
Blogs
Browse Blogs By:
WiFi Hot Spot Finder
Upload Centre
Upload Your Documents
Contribute and share with your peers by uploading:
- Initiative updates
- White Papers
- Job Links
- Events
- Other
Download Centre
Most popular downloads:
Download More Documents
Download:
- Initiative updates
- White Papers
- Job Links
Subscription Services
Manage your InterGovWorld.com account!
Change your account information, password, e-mail address, and existing e-newsletter subscriptions.
Site Feedback Survey
Tell us what you think of InterGovWorld.com!
FUN SurveyFUN Survey
Take the one-minute Family Unit Networking survey!
IT Salary Survey IT Salary Survey
Take the IT Salary Survey '06 Today
Career Resources
InterGovWorld provides links to resources for government job seekers and current employees, including: current job postings, job search strategies, career options and training, and employee rights, provided by all levels of government from everywhere across Canada.

Public Service Commission of Canada
Service Canada
Jobs in Canada
Service Canada
Public Service Human Resources Management Agency of Canada