Workflow Scheduling

The internal scheduling is a mission-critical process managed with enterprise-grade tools designed for reliability, observability, and dependency management. The industry-standard tool for this in modern data engineering is Apache Airflow. Other similar tools include Prefect, Dagster, or enterprise software like Control-M. Dependency Management: This is the #1 reason. An index rebalance isn't one script; it's a sequence of tasks (a Directed … Continue reading Workflow Scheduling

Advancing Python Knowledge_2

Project Packaging & Metadata setup.py: Used to package and distribute Python projects. Build tool: Use setuptools to define package name, version, dependencies, etc. Semantic Versioning: Follows MAJOR.MINOR.PATCH, e.g., 1.0.0. Special package attributes: Logging Import logger directly: from logging import Logger Common methods of logging.Logger instance: debug(), info(), warning(), error(), critical() Logger setup best practices (setup_logging) … Continue reading Advancing Python Knowledge_2

Future Working Relationship: From Employment to Partnership

The traditional employer-employee dynamic is quietly but meaningfully evolving. Power is shifting—tilting away from corporations and toward elite individual contributors. In a world increasingly shaped by innovation and intellectual leverage, top-tier talent is beginning to resemble Hollywood stars more than factory line workers. The Rise of the “Star” Engineer and Creator Much like how movie … Continue reading Future Working Relationship: From Employment to Partnership

Architecture Diagram of Indexing Engine

A high-level architectural diagram to visualize the system /index-platform/ ├── 📁 docs/ │ ├── methodology_guides/ │ ├── api_documentation.md │ └── architecture.md │ ├── 📁 libs/ (Shared Libraries & Core Logic) │ ├── 📁 data-connectors/ (Code to talk to Bloomberg, Refinitiv, etc.) │ │ ├── bloomberg.py │ │ └── refinitiv.py │ ├── 📁 index-methodologies/ (The "Rules … Continue reading Architecture Diagram of Indexing Engine

Design Patterns Content Outline

!\2. CREATIONAL DESIGN PATTERNS\ 0:07:22 1. Singleton 0:22:24 2. Factory method 0:32:20 3. Abstract factory 0:43:04 4. Builder 0:54:46 5. Prototype !\3. STRUCTURAL DESIGN PATTERNS 1:04:05 1. Adapter 1:15:23 2. Bridge 1:24:00 3. Composite 1:32:38 4. Decorator 1:43:14 5. Facade 1:52:49 6. Flyweight 2:06:00 7. Proxy !\4. BEHAVIOURAL DESIGN PATTERNS 2:16:40 1. Chain of responsibility … Continue reading Design Patterns Content Outline

Transition from writing scripts to building complex applications

First, what are the Key Differences Between Scripts and Applications Scripts: Linear execution flow Single responsibility Minimal error handling Direct database access Hardcoded values Limited user interaction Applications:  Logging and monitoring Modular architecture Multiple components working together Robust error handling Configuration management User interfaces (CLI, Web, API) Essential Concepts for Application Development a. Project Structure … Continue reading Transition from writing scripts to building complex applications

NewsSearchServer Diving Deep

What is it, NSS is a Http service that takes a search criteria and generates results from it, returning it to the requester. The search input format: POST of http://<host>/news/search?params; POST data being the search meta data + search criteria Search meta information sample: {"databases":[ {"databases":["BW"], "start_date":"20090901", "end_date":"20090905"}, {"databases":["FRC"]} ], "returned_fields":["headline","story_date","story_time","product"], "search_timeout":28000, "max_docs":200, "sort_spec":"story_date ASC", … Continue reading NewsSearchServer Diving Deep