IMG-OUR
OPINION

Cloud Migration Challenges posed by the legacy technical Debt

IMG-INTRO
IMG-INTRO
IMG-OUR

Rajendra Chaudhari

16 April 2021

The demands of the competitive nature of business in terms of data driven decision making related both to the customer reach and the operational efficiency are driving majority of businesses to cloud migration for data analytics and insights. The legacy environments are not able to offer the scale needed and the agility to scale, the flexibility needed to respond to the constantly changing velocity and veracity of custom access needs, and of course, the infrastructure costs are ballooning. On the other hand, the cloud environment offers the scalability, the agility and the flexibility at significantly lower costs.

Legacy system complexity is driven by proprietary and custom EDWs and ETLs

On the cloud end of this migration, the cloud ecosystem is brim with offerings to help make this journey smooth for businesses, specifically when it comes to applications and data. The biggest challenges the businesses face, however, are on the legacy side where the legacy systems are being ever so difficult to manage, especially the custom built proprietary ETLs and EDWs of the past. First of all, these systems offer the puzzling duality of being mission critical even today for our business users while at the same time being extremely fragile to change or enhancements. The resulting technical debt has truly become the Achilles heel for the organizations wanting to break free and move the richer cloud lands.

Legacy did not evolve in a structured manageable manner

There are several reasons for these challenges, of course hardly any of which can be attributed to any errors in their management, but rather to the nature of evolution of these legacy systems. For example,

  • Heterogeneity of the systems and workloads across business functions: Typical examples are proprietary ETLs such as Informatica or DataStage, Databases and Data Warehouse such as Teradata or Netezza, scripts such as BTEQ scripts, specialized jobs, stored procedures, proprietary reporting scripts etc.

  • Isolated and distributed nature of the systems: Most organizations will have their data, reporting and analytics scripts evolve in different systems over different time periods. E.g. sales leads and logistics reports can be in different systems and accessed through different batch processes. This also leads to issues related to non-standardization, disparity of terminologies related to business semantics as well as lineage issues.

  • Customizations over the years: On multiple occasions, the systems are customized to suit the needs of the business, for example, business logic is embedded into scripts and stored procedures.

  • Lack of documentation of value: More often than naught, either these implementations and customization are out of date on documentation or have documentation that does not really help explain the behavior of the systems

  • Non availability of the resources responsible for building these systems: Most resource that implemented these systems are either not with the organization anymore or have been reassigned to other important business within.

  • Limitations in accessing new resources who can decipher these systems: The resource issue is not so easy to resolve either, as such resources are not available for hire in this age. For example, engineers or developers who can work on VSAM or ISAM scripts on mainframes.

IMG-INTRO
IMG-INTRO

Concerns migrating an unknown

These challenges pose several difficulties and lead to questions in the minds of the business executives as well as the data architects, viz.

  • Would we be able to migrate all of our legacy to the cloud successfully? Can we understand the complexities involved in this migration? What level of success can we expect?

  • Would we be able to get some information on each of the workloads to understand the ROI and prioritize which workloads we migrate first?

  • Would we be able to optimize the workloads for the new systems for long term savings as we use the migrated workloads in future?

  • Can we estimate the time, effort and cost needed to migrate from these legacy systems?

  • How would we cost effectively validate that the migration is indeed successful?

  • Where would we find the resources needed to perform this migration?

It is not that this spells doom to the migration initiatives that the businesses would like to undertake, though this remains the single most important challenge to address and it surely is the very first one the businesses will come across.

We are not helpless

In the next article, we will take a look at the potential solutions that can reduce this technical debt by providing detailed insights about these legacy systems and help the migration initiatives of the businesses get off to a solid start.