Skip to main content

Posts

Showing posts from May, 2022

Building a Data Lake on Amazon Simple Storage Service

Amazon Simple Storage Service (S3) is a cloud-based data storage service that stores data in its native format. Data durability of S3 is always at a high of 99.999999999 (11 9s), and the data regardless of the volume is stored in a fully secured and safe ecosystem. In Amazon S3, data files that contain metadata and objects are stored in buckets for uploading. For metadata and files, the object is to be uploaded to S3. After this step, permissions can be granted on the metadata or related objects stored in the buckets. Many competencies can be used when an S 3 data lake   is built on Amazon S3. These include media data processing applications, Artificial Intelligence (AI), Machine Learning (ML), big data analytics, and high-performance computing (HPC). When all these are used in conjunction, businesses get access to critical data, business intelligence, and analytics from S3 data lake and unstructured data sets. There are several benefits of the S3 data lake. The first is different c

The Evolution of Technology of Oracle Change Data Capture

Oracle change data capture ( CDC) was first launched with the 9i version as an in-built tool of the Oracle database. It was a tool that recorded and monitored all changes made in the user tables in a database. These changes were then stored in change tables and used in ETL applications for later processing and transferring to other data warehouses and databases. The release version of Oracle change data capture   had triggers placed in the source database. However, database administrators found this technology very invasive and did not favor it. Ultimately, Oracle changed the Oracle change data capture   technology and released it with the 10g version after naming it Oracle Streams.  The working of this release was different. Oracle change data capture   used the redo logs of the source database along with a replication tool of Oracle Streams. This technology turned out to be very successful and a highly optimized method to identify and move change data to a target database without af