Add Google AI Support & Refactor AI Provider Settings by skorphil · Pull Request #101 · Mikodin/obsidian-scribe

skorphil · 2026-05-03T08:37:57Z

@Mikodin I can't test with OpenAI, because i have no money there. I tested only to the moment when error insufficient funds being returned. I re-introduced custom fetcher and it seems working (it fixes CORS issues with custom openAI providers). For gemini - separate langchain adapter is used. However I didn't integrate gemini transcription yet.

related to #81 and #54

Summary

This PR adds Google AI (Gemini) support as an alternative processing provider and refactors the AI provider settings architecture for better maintainability and extensibility. The changes enable users to choose between OpenAI, Google AI, and custom OpenAI-compatible endpoints for transcription and LLM processing.

Key Changes

🆕 Google AI Support

New utility file: src/util/geminiAiUtils.ts
- Implements summarizeTranscriptGemini() for transcript summarization using Google's Gemini models
- Implements llmFixMermaidChartGemini() for mermaid chart repair
- Exports LLM_MODELS enum with available Gemini models (gemini-2.5-pro, gemini-2.5-flash, etc.)
New settings: Added googleAiApiKey and googleModel to ScribePluginSettings
New dependency: Added @langchain/google-genai package

🔧 Obsidian Fetcher (CORS Fix)

New utility file: src/util/obsidianFetch.ts
- Custom fetch implementation wrapping Obsidian's requestUrl() API
- Solves CORS issues with OpenAI-compatible providers (e.g., Fireworks, local LLMs)
- Properly handles form data, data: URLs, and request/response conversion
- Integrated into both OpenAI SDK and LangChain's ChatOpenAI configurations

🏗️ Settings Architecture Refactor

Replaced AiModelSettings.tsx with modular settings structure:
- New ai-provider-settings-tab/ folder with:
  - ProviderSettingsTab.tsx - Main tab with platform selectors
  - ProviderSettingsSections.tsx - Modular sections per provider (OpenAI, Gemini, Custom)
  - index.ts - Re-exports
New platform enums:
- PROCESS_PLATFORM: openAi, google, customOpenAi
- TRANSCRIPT_PLATFORM: Added customOpenAi option
Removed: useCustomOpenAiBaseUrl boolean (replaced by processPlatform enum)
Automatic migration: src/settings/migration.ts detects the old useCustomOpenAiBaseUrl: true flag and automatically sets processPlatform and transcriptPlatform to customOpenAi on first load — no manual reconfiguration needed

⚡ Core Logic Refactoring (`src/index.ts`)

Platform-aware processing: Replaced if/else chains with switch statements for:
- Transcription platform selection (OpenAI, AssemblyAI, Custom OpenAI)
- LLM processing platform selection (OpenAI, Google, Custom OpenAI)
- Mermaid chart fixing
Improved error handling: Better error messages, type-safe error handling, and validation checks
Early validation: Checks for valid platform selection before processing

🛠️ Other Improvements

TypeScript: Added skipLibCheck: true to tsconfig.json to prevent type errors from dependencies
Code organization: Better separation of concerns between platform-specific logic
Type safety: Removed definite assignment assertion by properly initializing controlModal

Testing Checklist

Breaking Changes

Settings schema change: The useCustomOpenAiBaseUrl boolean setting is removed and replaced with the processPlatform enum. Existing settings are automatically migrated on first load — base URL, API key, and custom model names are preserved as-is.

Future plans

Add note to UI, that Currently only models with schema output supported
Add different API keys settings for OpenAI and Custom OpenAI
Add error state to Notice (which closed by X)
Add Notice to obsidianFetch errors
MAYBE separate fixing mermaid AI settings from processing. i.e: mermaidPlatform
Fetch list of models from openAi and gemini. NOT hardcode that list

…emini prompts - Add `obsidianFetch` utility to wrap Obsidian's `requestUrl`, bypassing CORS restrictions for OpenAI-compatible providers. - Integrate `obsidianFetch`

…schemas - Update `llmFixMermaidChartGemini` to use explicit HumanMessage and enforce real newline characters in the output. - Add debug logging to Gemini summarization and mermaid chart fixing functions. - Modify `obsidianFetch` to strip `$schema` and `title` from JSON schemas to ensure compatibility with providers like Groq. - Remove unused

Introduce `migrateSettings` to handle updates to the plugin configuration structure, ensuring backward compatibility when loading saved user data.

skorphil added 13 commits September 30, 2025 11:12

init package

b27b999

Merge remote-tracking branch 'upstream/main'

95d92bf

Merge remote-tracking branch 'upstream/main'

a7783c4

feat: refactored provider settings

ed05c39

refactor: updated langchain version

ff79442

feat: Implemented gemini processing of transcribed text

672c086

feat: Implemented gemini processing of transcribed text

bd5402d

fix: changed inputs in ai provider settings tab

5e41748

Merge remote-tracking branch 'upstream/main'

228fdc7

Merge branch 'main' into gemini-provider

fcf1a5a

feat(ai): implement obsidianFetch to resolve CORS issues and refine G…

763267f

…emini prompts - Add `obsidianFetch` utility to wrap Obsidian's `requestUrl`, bypassing CORS restrictions for OpenAI-compatible providers. - Integrate `obsidianFetch`

chore(settings): implement settings migration logic

47fcd2f

Introduce `migrateSettings` to handle updates to the plugin configuration structure, ensuring backward compatibility when loading saved user data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Google AI Support & Refactor AI Provider Settings#101

Add Google AI Support & Refactor AI Provider Settings#101
skorphil wants to merge 13 commits into
Mikodin:mainfrom
skorphil:gemini-provider

skorphil commented May 3, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

skorphil commented May 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key Changes

🆕 Google AI Support

🔧 Obsidian Fetcher (CORS Fix)

🏗️ Settings Architecture Refactor

⚡ Core Logic Refactoring (src/index.ts)

🛠️ Other Improvements

Testing Checklist

Breaking Changes

Future plans

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

skorphil commented May 3, 2026 •

edited

Loading

⚡ Core Logic Refactoring (`src/index.ts`)