Publishing a Customized Docker Image with Apache Hop, Workflows, and Pipelines Introduction In my previous blog on Docker container customization ( https://saikatsrecentworks.blogspot.com/2026/03/custom-hop-docker.html ), I discussed making the customized environment easily accessible. This post completes that effort by sharing the fully prepared Docker image. Docker Image Contents This Docker image is built on Alpine Linux and comes pre-installed with Java and Apache Hop. In addition, it includes all components required for my project: automated information ingestion from Microsoft Office 365 using Microsoft Graph APIs ( https://saikatsrecentworks.blogspot.com/2026/03/apache-hop-office365.html ) , powered by Python, Selenium, and Apache Hop. The purpose and implementation details of these components are explained in my earlier articles. Included components: Python 3 pip (Py3-pip) wget dpkg Chromium & Chromium Chromedriver Git Selenium and BeautifulSoup...
Posts
Customizing an Apache Hop Docker Container
- Get link
- X
- Other Apps
Cu stomizing an Apache Hop Docker Container for Automated Office 365 Authentication Introduction In my previous blog, Apache Hop implementation for selectively downloading email-attachments from MS Office365 Cloud , I focused on explaining Azure Entra Oath2 implementation using Apache hop and python. We chose to use Apache hop docker container for execution of the automated task via scheduler, following a ‘wake-up, do task, shutdown’ approach, as explained in hop documentation as a short-lived execution pattern. In this article, I describe the steps for customizing docker container, which was necessary to support the execution of Bash and Python scripts integrated within Hop pipelines and workflows Hop Container with External Dependencies Apache hop container required following external packages - Python3 Py3-pip Wget Dpkg Chromium-chromedriver Chromium Git Selenium (and bs4) for python3 Making Hop Container Ready The Apache Hop Docker image is built on a lightweight Alpin...