Merge pull request activepieces#6830 from activepieces/fix/handbook

docs: engineering playbooks
alan-eu · Feb 10, 2025 · 6090cfa · 6090cfa
2 parents 1e8251a + 6c3f36f
commit 6090cfa
Show file tree

Hide file tree

Showing 12 changed files with 332 additions and 131 deletions.
diff --git a/docs/handbook/engineering/incident-response.mdx b/docs/handbook/engineering/incident-response.mdx
diff --git a/docs/handbook/engineering/on-call.mdx b/docs/handbook/engineering/on-call.mdx
diff --git a/docs/handbook/engineering/how-we-work.mdx → ...ok/engineering/onboarding/how-we-work.mdx b/docs/handbook/engineering/how-we-work.mdx → ...ok/engineering/onboarding/how-we-work.mdx
@@ -1,5 +1,5 @@
 ---
-title: "How We Work?" 
+title: "Engineering Workflow" 
 icon: 'lightbulb'
 ---
 

diff --git a/docs/handbook/engineering/onboarding/on-call.mdx b/docs/handbook/engineering/onboarding/on-call.mdx
@@ -0,0 +1,64 @@
+---
+title: 'On-Call'
+icon: 'phone'
+---
+
+## Prerequisites:
+- [Setup Incident IO](../playbooks/setup-incident-io)
+
+## Why On-Call?
+
+We need to ensure there is **exactly one person** at the same time who is the main point of contact for the users and the **first responder** for the issues. It's also a great way to learn about the product and the users and have some fun.
+
+<Tip>
+  You can listen to [Queen - Under Pressure](https://www.youtube.com/watch?v=a01QQZyl-_I) while on-call, it's fun and motivating.
+</Tip>
+
+<Tip>
+  If you ever feel burn out in middle of your rotation, please reach out to the team and we will help you with the rotation or take over the responsibility.
+</Tip>
+
+## On-Call Schedule
+
+The on-call rotation is managed through Incident.io, with each engineer taking a one-week shift. You can:
+- View the current schedule and upcoming rotations on [Incident.io On-Call Schedule](https://app.incident.io/activepieces/on-call/schedules)
+- Add the schedule to your Google Calendar using [this link](https://calendar.google.com/calendar/r?cid=webcal://app.incident.io/api/schedule_feeds/cc024d13704b618cbec9e2c4b2415666dfc8b1efdc190659ebc5886dfe2a1e4b)
+
+<Warning>
+Make sure to update the on-call schedule in Incident.io if you cannot be available during your assigned rotation. This ensures alerts are routed to the correct person and maintains our incident response coverage.
+
+To modify the schedule:
+1. Go to [Incident.io On-Call Schedule](https://app.incident.io/activepieces/on-call/schedules)
+2. Find your rotation slot
+3. Click "Override schedule" to mark your unavailability
+4. Coordinate with the team to find coverage for your slot
+</Warning>
+
+
+## What it means to be on-call
+
+The primary objective of being on-call is to triage issues and assist users. It is not about fixing the issues or coding missing features. Delegation is key whenever possible.
+
+You are responsible for the following:
+
+* Respond to Slack messages as soon as possible, referring to the [customer support guidelines](./customer-support.mdx).
+
+* Check [community.activepieces.com](https://community.activepieces.com) for any new issues or to learn about existing issues.
+
+* Monitor your Incident.io notifications and respond promptly when paged.
+
+<Tip>
+  **Friendly Tip #1**: always escalate to the team if you are unsure what to do.
+</Tip>
+
+## How do you get paged?
+
+Monitor and respond to incidents that come through these channels:
+
+#### Slack Fire Emoji (🔥)
+When a customer reports an issue in Slack and someone reacts with 🔥, you'll be automatically paged and a dedicated incident channel will be created.
+
+#### Automated Alerts
+Watch for notifications from:
+  - Digital Ocean about CPU, Memory, or Disk outages 
+  - Checkly about e2e test failures or website downtime
diff --git a/docs/handbook/engineering/overview.mdx b/docs/handbook/engineering/overview.mdx
@@ -0,0 +1,8 @@
+---
+title: "Overview"
+icon: "code"
+---
+
+Welcome to the engineering team! This section contains essential information to help you get started, including our development processes, guidelines, and practices. We're excited to have you on board.
+
+
diff --git a/docs/handbook/engineering/playbooks/bullboard.mdx b/docs/handbook/engineering/playbooks/bullboard.mdx
@@ -0,0 +1,91 @@
+---
+title: "Queues Dashboard"
+icon: "gauge-high"
+---
+
+The Bull Board is a tool that allows you to check issues with scheduling and internal flow runs issues.
+
+![BullBoard Overview](https://raw.githubusercontent.com/felixmosh/bull-board/master/screenshots/overview.png)
+
+## Setup BullBoard
+
+To enable the Bull Board UI in your self-hosted installation:
+
+1. Define these environment variables:
+   - `AP_QUEUE_UI_ENABLED`: Set to `true`
+   - `AP_QUEUE_UI_USERNAME`: Set your desired username
+   - `AP_QUEUE_UI_PASSWORD`: Set your desired password
+
+2. Access the UI at `/api/ui`
+
+
+<Tip>
+For cloud installations, please ask your team for access to the internal documentation that explains how to access BullBoard.
+</Tip>
+
+## Common Issues
+
+### Scheduling Issues
+
+If a scheduled flow is not triggering as expected:
+
+1. Check the `repeatableJobs` queue in BullBoard to verify the job exists
+2. Verify the job status is not "failed" or "delayed"
+3. Check that the cron expression or interval is configured correctly
+4. Look for any error messages in the job details
+
+### Flow Stuck in "Running" State
+
+If a flow appears stuck in the running state:
+
+1. Check the `oneTimeJobs` queue for the corresponding job
+2. Look for:
+   - Jobs in "delayed" state (indicates retry attempts)
+   - Jobs in "failed" state (indicates execution errors)
+3. Review the job logs for error messages or timeouts
+4. If needed, you can manually remove stuck jobs through the BullBoard UI
+
+## Queue Overview
+
+We maintain four main queues in our system:
+
+#### Scheduled Queue (`repeatableJobs`)
+
+Contains both polling and delayed jobs.
+
+<Info>
+Failed jobs are not normal and need to be checked right away to find and fix what's causing them.
+</Info>
+
+<Tip>
+Delayed jobs represent either paused flows scheduled for future execution or upcoming polling job iterations.
+</Tip>
+
+#### One-Time Queue (`oneTimeJobs`)
+Handles immediate flow executions that run only once
+
+<Info>
+- Delayed jobs indicate an internal system error occurred and the job will be retried automatically according to the backoff policy
+- Failed jobs require immediate investigation as they represent executions that failed for unknown reasons that could indicate system issues
+</Info>
+
+#### Webhook Queue (`webhookJobs`)
+
+Handles incoming webhook triggers
+
+<Info>
+- Delayed jobs indicate an internal system error occurred and the job will be retried automatically according to the backoff policy
+- Failed jobs require immediate investigation as they represent executions that failed for unknown reasons that could indicate system issues
+</Info>
+
+#### Users Interaction Queue (`usersInteractionJobs`)
+
+Handles operations that are directly initiated by users, including:
+• Installing pieces
+• Testing flows
+• Loading dropdown options
+• Executing triggers
+• Executing actions
+<Info>
+Failed jobs in this queue are not retried since they represent real-time user actions that should either succeed or fail immediately
+</Info>
diff --git a/docs/handbook/engineering/playbooks/database-migration.mdx b/docs/handbook/engineering/playbooks/database-migration.mdx
@@ -0,0 +1,94 @@
+---
+title: "Database Migrations"
+description: "Guide for creating database migrations in Activepieces"
+icon: "database"
+---
+
+Activepieces uses TypeORM as its database driver in Node.js. We support two database types across different editions of our platform.
+
+The database migration files contain both what to do to migrate (up method) and what to do when rolling back (down method).
+
+<Tip>
+Read more about TypeORM migrations here:
+https://orkhan.gitbook.io/typeorm/docs/migrations
+</Tip>
+
+## Database Support
+
+- PostgreSQL
+- SQLite
+
+<Tip>
+**Why Do we have SQLite?**
+We support SQLite to simplify development and self-hosting. It's particularly helpful for:
+
+- Developers creating pieces who want a quick setup
+- Self-hosters using platforms to manage docker images but doesn't support docker compose.
+</Tip>
+
+## Editions
+
+- **Enterprise & Cloud Edition** (Must use PostgreSQL)
+- **Community Edition** (Can use PostgreSQL or SQLite)
+
+<Tip>
+If you are generating a migration for an entity that will only be used in Cloud & Enterprise editions, you only need to create the PostgreSQL migration file. You can skip generating the SQLite migration.
+</Tip>
+
+
+### How To Generate
+
+<Steps>
+  <Step title="Uncomment Database Connection Export">
+    Uncomment the following line in `packages/server/api/src/app/database/database-connection.ts`:
+    ```typescript
+    export const exportedConnection = databaseConnection()
+    ```
+  </Step>
+
+  <Step title="Configure Database Type">
+    Edit your `.env` file to set the database type:
+
+    ```env
+    # For SQLite migrations (default)
+    AP_DATABASE_TYPE=SQLITE
+    ```
+
+    For PostgreSQL migrations:
+    ```env
+    AP_DATABASE_TYPE=POSTGRES
+    AP_POSTGRES_DATABASE=activepieces
+    AP_POSTGRES_HOST=db
+    AP_POSTGRES_PORT=5432
+    AP_POSTGRES_USERNAME=postgres
+    AP_POSTGRES_PASSWORD=password
+    ```
+  </Step>
+
+  <Step title="Generate Migration">
+    Run the migration generation command:
+    ```bash
+    nx db-migration server-api name=<MIGRATION_NAME>
+    ```
+    Replace `<MIGRATION_NAME>` with a descriptive name for your migration.
+  </Step>
+
+  <Step title="Move Migration File">
+    The command will generate a new migration file in `packages/server/api/src/app/database/migrations`. 
+    Review the generated file and:
+
+    - For PostgreSQL migrations: Move it to `postgres-connection.ts`
+    - For SQLite migrations: Move it to `sqlite-connection.ts`
+  </Step>
+
+  <Step title="Re-comment Export">
+    After moving the file, remember to re-comment the line from step 1:
+    ```typescript
+    // export const exportedConnection = databaseConnection()
+    ```
+  </Step>
+</Steps>
+
+<Tip>
+Always test your migrations by running them both up and down to ensure they work as expected.
+</Tip>