Skip to content

gaowatch/veyranova

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Blind Spatial Protocol (BSP)

Pure-Text LLM Desktop GUI Control via ElementMap & UIA Element ID

📄 Preprint Paper

✨ Core Innovations (Eliminate Coordinate Hallucination at Root)

  1. ElementMap Blind Operation Protocol: A structured element mapping mechanism that lets pure-text LLMs directly reference UIA element IDs to locate GUI components
  2. Zero Coordinate Guessing: Fundamentally eliminates the coordinate hallucination problem that plagues all vision-based agents, ensuring 100% operation accuracy
  3. Vision-Free & Privilege-Free: No screenshots, no elevated system privileges, accessible to all ordinary users out of the box
  4. Complete Tool-Call Parsing Pipeline: End-to-end parsing of LLM instructions to ensure accurate execution of operations
  5. Constitutional-Level Security Pre-Check: Built-in inviolable security rules to prevent malicious operations and protect system safety
  6. Pure-Text LLM Native Support: Compatible with any pure-text large language model, no multimodal model required

🎯 Related Project

This paper corresponds to the open-source desktop agent project VeyraNova, which is under active development, with its core implementation based on the BSP protocol.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors