|
1 | 1 | --- |
2 | | -title: GroupDocs.Parser Product Family |
3 | | -additionalTitle: GroupDocs API References |
| 2 | +title: "GroupDocs.Parser API References" |
| 3 | +additionalTitle: "Document Parser and Extractor SDKs" |
4 | 4 | type: docs |
5 | 5 | weight: 10 |
6 | | -description: "Parse, extract images, raw & formatted text with metadata and perform a lot of operations with it using APIs which work on all popular platforms and supported file formats" |
| 6 | +description: "Explore GroupDocs.Parser API references to parse and extract text, images, metadata, and structured data from 50+ document formats (PDF, Word, Excel, PowerPoint, and more) with ease." |
| 7 | +keywords: "GroupDocs.Parser, document parser API, .NET parser, Java parser, Python parser, PDF parser, Word parser, Excel parser, text extraction, image extraction, metadata extraction, document processing SDK" |
7 | 8 | url: / |
| 9 | +date: "2025-08-26" |
| 10 | +lastmod: "2025-08-26" |
| 11 | +categories: ["Document Processing SDKSs"] |
| 12 | +tags: ["groupdocs", "parser", "extraction"] |
8 | 13 | --- |
9 | 14 |
|
10 | | -## GroupDocs.Parser for .NET |
| 15 | +This page serves as the entry point to the **GroupDocs.Parser API references**. |
11 | 16 |
|
12 | | -{{% alert color="primary" %}} |
| 17 | +**GroupDocs.Parser** is a set of powerful APIs that enables you to parse and extract text, images, metadata, and structured data from popular document formats such as **PDF, Word, Excel, PowerPoint, and more** across multiple platforms. |
13 | 18 |
|
14 | | - |
| 19 | +## Available Products |
15 | 20 |
|
16 | | -On Premise Parser APIs for .NET Framework based applications to extract data from the supported document file formats. |
| 21 | +Select your target platform below to access detailed API code documentation. |
17 | 22 |
|
18 | | -{{% /alert %}} |
| 23 | +<div style="display: flex;flex-wrap: wrap;justify-content: left;gap: 10px;padding: 10px;"> |
19 | 24 |
|
20 | | -These are links to some useful resources: |
| 25 | + <a href="/parser/net/" |
| 26 | + style="width: 190px; height: 220px;background-color: #f7f7f7;border: 1px solid #ccc;border-radius: 8px;display: flex;flex-direction: column;align-items: center;justify-content: space-around;padding: 20px;box-shadow: 0 4px 8px rgba(0, 0, 0, 0.1);transition: transform 0.3s ease;text-decoration: none;color: inherit;"> |
| 27 | + <img src="gdocs_net.png" |
| 28 | + alt="GroupDocs.Parser for .NET Icon" style="width: 70px; height: 70px; object-fit: contain;"> |
| 29 | + <h3 style="font-size: 16px; color: #333; margin: 10px 0; text-align: center;">GroupDocs.Parser for .NET</h3> |
| 30 | + <p style="font-size: 12px; color: #666; text-align: center;">Access full API references for .NET developers</p> |
| 31 | + </a> |
21 | 32 |
|
22 | | -- [GroupDocs.Parser for .NET API Reference](/parser/net/) |
| 33 | + <a href="/parser/java/" |
| 34 | + style="width: 190px; height: 220px;background-color: #f7f7f7;border: 1px solid #ccc;border-radius: 8px;display: flex;flex-direction: column;align-items: center;justify-content: space-around;padding: 20px;box-shadow: 0 4px 8px rgba(0, 0, 0, 0.1);transition: transform 0.3s ease;text-decoration: none;color: inherit;"> |
| 35 | + <img src="gdocs_java.png" |
| 36 | + alt="GroupDocs.Parser for Java Icon" style="width: 70px; height: 70px; object-fit: contain;"> |
| 37 | + <h3 style="font-size: 16px; color: #333; margin: 10px 0; text-align: center;">GroupDocs.Parser for Java</h3> |
| 38 | + <p style="font-size: 12px; color: #666; text-align: center;">Access full API references for Java developers</p> |
| 39 | + </a> |
23 | 40 |
|
| 41 | + <a href="/parser/python-net/" |
| 42 | + style="width: 190px; height: 220px;background-color: #f7f7f7;border: 1px solid #ccc;border-radius: 8px;display: flex;flex-direction: column;align-items: center;justify-content: space-around;padding: 20px;box-shadow: 0 4px 8px rgba(0, 0, 0, 0.1);transition: transform 0.3s ease;text-decoration: none;color: inherit;"> |
| 43 | + <img src="gdocs_python_net.png" |
| 44 | + alt="GroupDocs.Parser for Python via .NET Icon" style="width: 70px; height: 70px; object-fit: contain;"> |
| 45 | + <h3 style="font-size: 16px; color: #333; margin: 10px 0; text-align: center;">GroupDocs.Parser for Python via .NET</h3> |
| 46 | + <p style="font-size: 12px; color: #666; text-align: center;">Access full API references for Python developers</p> |
| 47 | + </a> |
24 | 48 |
|
25 | | -## GroupDocs.Parser for Java |
| 49 | +</div> |
26 | 50 |
|
27 | | -{{% alert color="primary" %}} |
| 51 | +## Additional Resources |
28 | 52 |
|
29 | | - |
| 53 | +### Product Overview |
30 | 54 |
|
31 | | -On Premise APIs for Java based applications to parse and extract data from the supported document file formats. |
| 55 | +**GroupDocs.Parser** is a comprehensive document parser and extractor SDKs that provides developers with powerful APIs to extract data from documents without requiring external dependencies or additional software installations. The library supports parsing and extraction from over 50 document formats including PDF, Microsoft Word, Excel, PowerPoint, OneNote, Outlook, and many more. |
32 | 56 |
|
33 | | -{{% /alert %}} |
| 57 | +#### Key Features |
34 | 58 |
|
35 | | -These are links to some useful resources: |
| 59 | +- **Text Extraction**: Extract raw or formatted text from entire documents or specific pages |
| 60 | +- **Image Extraction**: Extract images from documents with support for various image formats |
| 61 | +- **Metadata Extraction**: Retrieve document properties, creation dates, author information, and more |
| 62 | +- **Structured Data Parsing**: Extract tables, forms, and structured data using template-based parsing |
| 63 | +- **Container Extraction**: Extract attachments and embedded documents from container formats |
| 64 | +- **Cross-Platform Support**: Available for .NET, Java, and Python platforms |
| 65 | +- **No External Dependencies**: Parse documents without requiring Microsoft Office, Adobe Acrobat, or other third-party software |
36 | 66 |
|
37 | | -- [GroupDocs.Parser for Java API Reference](/parser/java/) |
| 67 | +#### Supported File Formats |
| 68 | + |
| 69 | +GroupDocs.Parser supports a wide range of document formats: |
| 70 | + |
| 71 | +- **Word Processing**: DOC, DOCX, DOT, DOTX, RTF, ODT, OTT |
| 72 | +- **Spreadsheets**: XLS, XLSX, XLSM, XLSB, CSV, ODS, OTS |
| 73 | +- **Presentations**: PPT, PPTX, PPS, PPSX, ODP, OTP |
| 74 | +- **PDF Documents**: PDF, PDF/A |
| 75 | +- **Email**: MSG, EML, EMLX, PST, OST |
| 76 | +- **Archives**: ZIP, TAR, RAR |
| 77 | +- **Other Formats**: OneNote, Markdown, EPUB, and more |
| 78 | + |
| 79 | +#### Common Use Cases |
| 80 | + |
| 81 | +- Document indexing and search engine integration |
| 82 | +- Content management systems (CMS) |
| 83 | +- Data migration and conversion projects |
| 84 | +- Document analysis and reporting |
| 85 | +- Automated document processing workflows |
| 86 | +- Text mining and content extraction |
| 87 | +- Metadata cataloging and organization |
| 88 | + |
| 89 | +### Documentation and Downloads |
| 90 | + |
| 91 | +- [Product Overview](https://products.groupdocs.com/parser/) - Learn about features, supported formats, and use cases |
| 92 | +- [Developer Documentation](https://docs.groupdocs.com/parser/) - Comprehensive guides, tutorials, and code examples |
| 93 | +- [Blog](https://blog.groupdocs.com/category/parser/) – Latest updates and tutorials |
| 94 | +- [Live Demos](https://products.groupdocs.app/parser/family) – Interactive online demo applications |
| 95 | +- [Releases & Downloads](https://releases.groupdocs.com/parser/) - Download the latest versions and release notes |
0 commit comments