Elvis Davidtechml.hashnode.net·Feb 23, 2023Apache Flink 101: Understanding the ArchitectureTime = value Introduction Data is generated from many sources, including financial transactions, location-tracking feeds, measurements from Internet of Things (IoT) devices, and web user activity. Formerly, batch processing was used to manage these ...Discuss·11 likesapache-flink
Faithful Adeda forAdematics Blogadematics.hashnode.net·Aug 1, 2022Data warehouse FundamentalsData warehouse gives businesses the ability to consolidate data from various sources, such as transactional systems, operational databases, and flat files. Through data integration, bad data removal, duplication elimination, and standardization to a ...Discuss·5 likes·132 reads#datawarehouse
Alvin Endratnoalvinend.hashnode.net·Sep 9, 2022Setup Jupyter in EC2 and Apache Spark with Delta Lake connection to S3Delta lake has been booming for the last two years after Databricks announce it as "New Generation Data Lakehouse," but behind the boom, there are not enough examples and posts of it. I want to change it by adding one article about it. This time we w...Discuss·2 likes·203 readsDelta Lakelakehouse
Deekshith Reddydeekshithreddy.hashnode.net·Mar 15, 2023Managing Redshift Data: Unloading and Restoring with EaseIn this blog post, we will explore how to unload a Redshift table to S3 and restore the CSV into the Redshift table. Introduction:Amazon Redshift is a fully-managed data warehouse service in the cloud, designed for enterprise-level data storage and...Discuss#datawarehouse
Constantin Lungucnstlungu.hashnode.net·Feb 27, 2023Filling in missing data in BigQueryData comes in a vast range of shapes and sizes for Data Engineers, with common categories being structured data (such as a database table), semi-structured (such as JSON or XML), and unstructured data (such as text or images). Naturally, the requirem...DiscussPractical BigQuerybigquery
Elvis Davidtechml.hashnode.net·Feb 23, 2023Apache Flink 101: Understanding the ArchitectureTime = value Introduction Data is generated from many sources, including financial transactions, location-tracking feeds, measurements from Internet of Things (IoT) devices, and web user activity. Formerly, batch processing was used to manage these ...Discuss·11 likesapache-flink
Richmondmachiavelli.hashnode.net·Feb 14, 2023Data Warehouse Implementation in Oracle APEX.Oracle APEX is a powerful platform that can be used for building data warehouses and performing ETL operations. A data warehouse is a large repository of data that is collected from various sources and then transformed into a structure that is optimi...Discuss·1.3K reads#oracle-apex
George Githiriggithiri.hashnode.net·Jan 29, 2023AWS RedshiftAn Overview AWS Redshift is a fast, scalable and fully managed data warehousing service provided by Amazon Web Services (AWS). It enables organizations to store and analyze petabyte-scale data at low costs and with ease. With AWS Redshift, you can qu...Discuss·40 readsAWS Redshif
Jeremiah Laniermachinesintheclouds.hashnode.net·Jan 28, 2023ETL Overview with PythonETL, or Extract, Transform, and Load, is a process used to collect and prepare data for analysis and reporting. The process involves extracting data from various sources, transforming it to fit the needs of the analysis, and loading it into a target ...Discuss·45 readsETL
A Gokulnibunar.hashnode.net·Jan 26, 2023The Evolution of Data Warehouses: From Inmon to the CloudA data warehouse is a centralized repository that allows organizations to store and analyze large amounts of data. It is designed to support the efficient querying and analysis of data and is often used in business intelligence and data mining applic...Discuss#datawarehouse
Harry Patelpatelsaheb.hashnode.net·Jan 20, 2023How to copy data from s3 to AWS Redshift table?Prerequisite AWS account AWS CLI setup in local redshift cluster created s3 bucket Please follow the below link to create a redshift cluster, setup redshift setup data into an s3 bucket download the file in your local system from this link. No...DiscussDatabases
Jeremiah Laniermachinesintheclouds.hashnode.net·Jan 20, 2023Brief Overview OF AWS Database ServicesThis will be a continuation of part one. If you haven’t read the first article please click here to read it and then come back, as there is a lot of introductory information missing here. As a quick recap, we established there are usually two types o...DiscussAWS
bhawesh mehtabhawesh.hashnode.net·Jan 19, 2023What is Grain in Data ModellingAccording to The Data Warehouse Toolkit by Kimball "The grain must be declared before choosing dimensions or facts because every candidate dimension or fact must be consistent with the grain." Some examples: The sales table has a grain of DAY, STO...Discussdata-engineering