permitio · Taofiqq · Feb 18, 2025 · Feb 17, 2025 · Feb 18, 2025 · Feb 18, 2025
diff --git a/README.md b/README.md
@@ -1,105 +1,193 @@
 # LangChain Permit Integration
 
-Access Control components for LangChain using Permit.io: authenticate users and agents, filter prompts, protect RAG access, secure API interactions, and enforce AI responses with robust, fine-grained permission management across your AI applications.
+Combine [LangChain](https://github.com/hwchase17/langchain) and [Permit.io](https://permit.io/) to add robust access control and permission logic to your LLM applications. This package offers:
 
-## Key Features
+- **LangChain Tools** for JWT validation and direct permission checks
+- **LangChain Retrievers** that automatically filter or retrieve only the documents your user is allowed to see (Self Query + Ensemble)
+- Simple examples and demos to showcase usage
 
-- 🔒 JWT Token Validation
-- 🛡️ Fine-grained authorization checks
-- 📄 RAG search filtering
-- 🤖 Permissions-aware RAG retrieval
+With this integration, you can:
+
+- Validate user tokens and ensure only authorized requests get access
+- Filter query results and documents by Permit’s policy logic (RBAC, ABAC, ReBAC)
+- Seamlessly embed Permit checks in a RAG pipeline or a chain/agent-based workflow
+
+---
+
+## Features
+
+1. **JWT Validation Tool**  
+   Validate JSON Web Tokens against a JWKs endpoint or direct JWKs JSON.
+
+2. **Permissions Check Tool**  
+   Check user / resource / action with Permit’s PDP at runtime.
+
+3. **PermitSelfQueryRetriever**  
+   A self-querying retriever that uses an LLM to parse a user’s natural language query, obtains the permitted resource IDs from Permit, and filters the vector store accordingly.
+
+4. **PermitEnsembleRetriever**  
+   Combines multiple underlying retrievers (like BM25 + vector) and then calls Permit to filter out unauthorized results.
+
+---
 
 ## Installation
 
 ```bash
 pip install langchain-permit
 ```
 
-## Configuration
-
-Set up your access control credentials:
+You’ll also need the [Permit](https://docs.permit.io/sdk/python/quickstart-python/) package if not already installed:
 
 ```bash
-export PERMIT_API_KEY='your-permit-api-key'
-export PERMIT_PDP_URL='a cloud or local address of your policy decision point'
-export JWKS_URL='a .well-known URL with your JSON Web Keys'  # Optional
+pip install permit
 ```
 
-## AI Access Control Four-Perimeters Framework
+## Environment Variables
 
-The integration covers the four critical perimeters for access control in AI applications:
-
-1. **Prompt Filtering**
+```bash
+PERMIT_API_KEY=your_api_key
+PERMIT_PDP_URL=http://localhost:7766   # or your real PDP
+JWKS_URL=http://localhost:3458/.well-known/jwks.json  # For JWT validation
+OPENAI_API_KEY=sk-...                 # If using OpenAI embeddings or chat models
+```
 
-   - Validate user permissions before processing AI prompts
-   - Prevent unauthorized query generation
+For usage, you’ll want to confirm your PDP is running, or you have Permit.io set up to match your policy configuration (resource types, roles, etc.). See [Permit Docs](https://docs.permit.io/concepts/pdp/overview/) for more on setting up the PDP container and writing policy rules.
 
-2. **RAG Data Protection**
+## Basic Usage Examples
 
-   - Filter document retrieval based on user roles
-   - Ensure users access only permitted documents
+### JWT Validation Tool
 
-3. **Secure External Access**
+```python
+from langchain_permit.tools import LangchainJWTValidationTool
 
-   - Control API endpoint access
-   - Implement approval workflows for sensitive actions
+jwt_validator = LangchainJWTValidationTool(
+    jwks_url="http://localhost:3458/.well-known/jwks.json" # this is just a sample url, you can add your own jwks url
+)
 
-4. **Response Enforcement**
-   - Filter and sanitize AI-generated responses
-   - Prevent information leakage
+# In an async context:
+# claims = await jwt_validator._arun(my_jwt_token)
+# print("Decoded claims:", claims)
+```
 
-## Usage Examples
+Check out `examples/demo_jwt_validation.py` for a fully runnable script.
 
-### Permission Checking
+### Permission Check Tool
 
 ```python
-from langchain_permit import LangchainPermitTool
+from permit import Permit
+from langchain_permit.tools import LangchainPermissionsCheckTool
 
-# Initialize permission tool
-permit_tool = LangchainPermitTool(
-    user_id='user123',
-    jwt_token='user_jwt_token'
+permit_client = Permit(
+    token="permit_api_key_here",
+    pdp="http://localhost:7766" # or your real deployment url
 )
 
-# Check permission
-is_permitted = permit_tool.run(
-    action='read',
-    resource='financial_documents'
+permissions_checker = LangchainPermissionsCheckTool(
+    name="permission_check",
+    permit=permit_client,
 )
+
+# In an async context:
+# result = await permissions_checker._arun(
+#     user={"key": "user123"},
+#     action="read",
+#     resource={"type": "Document", "key": "doc123", "tenant": "default"}
+# )
+# print("Permission check result:", result)
 ```
 
-### RAG with Permission Filtering
+Check out `examples/demo_permissions_check.py` for a runnable demonstration.
 
-```python
-from langchain_permit import LangchainPermitRetriever
+### PermitSelfQueryRetriever
+
+A custom retriever that:
 
-# Initialize retriever with user context
-retriever = LangchainPermitRetriever(
-    user_id='finance_user',
-    jwt_token='finance_jwt_token'
+1. Fetches permitted document IDs from Permit.
+2. Uses an LLM to parse your user’s query into a structured filter (Self Query).
+3. Applies that ID-based filter to the vector store search.
+
+```python
+from langchain_openai import OpenAIEmbeddings
+from langchain_community.vectorstores import FAISS
+from langchain_permit.retrievers import PermitSelfQueryRetriever
+
+# Suppose we have some documents
+docs = [...]
+embeddings = OpenAIEmbeddings()
+vectorstore = FAISS.from_documents(docs, embeddings)
+
+retriever = PermitSelfQueryRetriever(
+    api_key="...",
+    pdp_url="...",
+    user={"key": "user_123"},
+    resource_type="my_resource",
+    action="view",
+    llm=embeddings,                # or ChatOpenAI, for actual LLM-based query parsing
+    vectorstore=vectorstore,
+    enable_limit=False,
 )
 
-# Retrieves only documents user is authorized to access
-documents = retriever.invoke("Q1 financial summary")
+query = "Which docs talk about cats?"
+docs = retriever.get_relevant_documents(query)
+for doc in docs:
+    print(doc.metadata.get("id"), doc.page_content)
 ```
 
-## Advanced Configuration
+See a complete script at `examples/demo_self_query.py`.
+
+### PermitEnsembleRetriever
 
-### Custom PDP URL
+This retriever leverages EnsembleRetriever from LangChain, merging multiple child retrievers, and then uses Permit to filter out any unauthorized docs.
 
 ```python
-permit_tool = LangchainPermitTool(
-    api_key='your-api-key',
-    pdp_url='https://custom-pdp.permit.io'
-)
+import os
+import asyncio
+from langchain_core.documents import Document
+from langchain_community.vectorstores import FAISS
+from langchain_openai import OpenAIEmbeddings
+from langchain_permit.retrievers import PermitEnsembleRetriever
+
+async def main():
+    # Sample documents
+    texts = [
+        ("doc_a", "Cats are wonderful creatures..."),
+        ("doc_b", "Dogs are quite loyal..."),
+    ]
+    docs = [Document(page_content=txt, metadata={"id": idx}) for (idx, txt) in texts]
+
+    # Vector store
+    embeddings = OpenAIEmbeddings()
+    vectorstore = FAISS.from_documents(docs, embedding=embeddings)
+    vector_retriever = vectorstore.as_retriever(search_kwargs={"k": 2})
+
+    # Ensemble with just one child retriever for simplicity
+    ensemble_retriever = PermitEnsembleRetriever(
+        api_key=os.getenv("PERMIT_API_KEY", ""),
+        pdp_url=os.getenv("PERMIT_PDP_URL"),
+        user="user_abc",
+        action="view",
+        resource_type="my_resource",
+        retrievers=[vector_retriever],  # Or pass multiple retrievers
+    )
+
+    query = "tell me about cats"
+    results = await ensemble_retriever._aget_relevant_documents(query, run_manager=None)
+
+    for i, doc in enumerate(results, start=1):
+        print(f"{i}. {doc.metadata.get('id')}: {doc.page_content}")
+
+if __name__ == "__main__":
+    asyncio.run(main())
 ```
 
+Check out examples/demo_ensemble.py for a more complete version.
+
 ## Requirements
 
-- Python 3.8+
-- LangChain
-- Permit.io Account
+1. Python 3.8+
+2. [Permit.io](https://app.permit.io/) Account
+3. [LangChain](https://python.langchain.com/docs/introduction/)
 
-## Contributing
+## License
 
-Contributions are welcome! Please read our [Contributing Guidelines](CONTRIBUTING.md) for details.
+This project is MIT Licensed. See [Permit.io Docs](https://docs.permit.io/) for terms related to the Permit PDP and hosted services.