Turn Unstructured Documents into Verifiable AI Insights
Clarisearch™ is a proprietary pipeline that converts very large case datasets into a structured, evidence-linked JSON at a fixed, predictable cost—so any LLM can generate professional reports with verifiable citations back to the original sources. Our patent-pending technology eliminates context window limits and AI hallucinations for good.
A Simple, Asynchronous Workflow
The Clarisearch™ API is designed for secure, large-scale processing using our proprietary fixed-cost pipeline. Submit your documents and get notified when your structured Evidence Packet is ready. Click each step below to see the technical details.
Initiate Job
Securely submit your research query and a list of pre-signed URLs for your documents.
We Process
Our system downloads, analyzes, and synthesizes the data, then permanently deletes your source files.
Receive Results
Get notified via webhook or polling, and download your completed Evidence Packet.
The Evidence Packet Explorer
The result of every job is a rich, structured JSON Evidence Packet. This becomes your single source of truth, ready for any LLM. Explore the data from a sample packet below.
Packet Analysis
Evidence Strength Distribution
A qualitative assessment of the evidence's probative value.
Filter by Thematic Tags
Click tags to filter the evidence list.
Evidence Items (0)
Build Powerful Applications
The Clarisearch™ Evidence Packet is a versatile asset. Here's how our clients are using the API to build innovative tools and products powered by our proprietary technology.
Automated Operational Summaries for Firm Management
A large firm needs high-level overviews of active cases. Their internal dashboard uses the API to analyze new documents weekly, generating custom summaries from the Evidence Packet that align with the firm's priorities.
Business Value:
- Efficiency: Standardized, digestible summaries delivered automatically.
- Custom Insights: Reports are tailored to the firm's unique methodology.
- Strategic Oversight: Better decisions based on consistent, evidence-backed reporting.
Verifiable AI in Action
The key to trustworthy AI is verifiable output. The Clarisearch™ system makes this possible through our proprietary evidence-linking technology. Click any [Source:...] tag below to see the underlying evidence.
Case Status Memo
Case Summary
This case involves multiple safety violations by TEST PIPE, INC. at an inspection site located at 1492 PENGUIN WAY, ARMADILLO, LONE STAR [Source: evidence-FlSR067LOJJEaoezVh63-6]. An OSHA citation details serious issues, including failure to provide adequate protection from hazards and a lack of proper employee training [Source: evidence-FlSR067LOJJEaoezVh63-6]. These actions constitute a significant breach of the company's duty of care.
Key Factual Findings
- The incident in question occurred on or about Spring 2021 [Source: evidence-FlSR067LOJJEaoezVh63-0].
- Dr. Flory was deposed and provided testimony regarding the events [Source: evidence-FlSR067LOJJEaoezVh63-1].
- The patient's initial complaint was recorded as a cough [Source: evidence-FlSR067LOJJEaoezVh63-2].
- A key issue is whether the standard of care was met, which is a central point of contention [Source: evidence-FlSR067LOJJEaoezVh63-3].
Core Benefits and Uncompromising Security
The Clarisearch™ API provides the power of large-scale AI analysis through our proprietary, patent-pending pipeline with predictable costs and an uncompromising commitment to data security.
Predictable Cost
Avoid volatile AI bills. Our patent-pending pipeline technology delivers fixed, predictable costs for deep document analysis.
No Context Windows
Our proprietary pre-processing creates an Evidence Packet that eliminates the context window limitations of modern LLMs.
Zero Hallucinations
Every data point in the Evidence Packet is linked to its source with a unique evidence_id, guaranteeing factually grounded output.
Data Custody
You own your data. We use temporary, pre-signed URLs and never store your original files.
Important: Use of the Clarisearch™ API is subject to our Master Service Agreement, which includes confidentiality obligations and prohibitions on reverse engineering, benchmarking, or attempting to derive our proprietary methods. All feedback and suggestions become the property of DepoGenius.
Developer Documentation
Complete technical documentation for integrating with the Clarisearch™ API. Follow these step-by-step instructions to start processing documents and generating Evidence Packets programmatically.
Getting Started
1. Obtain Your API Key
To use the Clarisearch™ API, you'll need an API key. Contact our
team to get your unique API key, which will have the format
csk_xxxxxxxxx
.
Security Note: Store your API key securely and never expose it in client-side code. All API requests must be made from your server environment.
2. Secure Your Files (Critical)
Clarisearch™ uses a zero-trust security model. Your files remain in your own secure storage, and you provide temporary pre-signed URLs for access. We never request permanent credentials or ask you to make files public.
✅ Secure Approach: Pre-Signed URLs
- • Files remain private in your storage system
- • URLs are time-limited (4+ hours recommended)
- • Each URL grants access to exactly one file
- • Complete audit trail and control
❌ Never Do This:
- • Don't make files publicly accessible
- • Don't share permanent storage credentials
- • Don't use URLs that expire too quickly
Quick Example (Google Cloud Storage):
See the complete security guide with examples for all major cloud providers below.
3. Base URL
All API requests should be made to:
Replace the endpoint suffix (e.g., submitJob
,
getJobStatus
) based on the specific operation
you're performing.
Authentication
All API requests require authentication using your API key in the Authorization header:
✅ Correct Format
❌ Incorrect Formats
API Endpoints
Submit Research Job
Initiates a new document analysis job. Provide your research query and a list of pre-signed URLs to your documents.
Endpoint:
Request Body:
Required Fields:
- query: Your research question or analysis request
- files: Array of file objects with fileName and downloadUrl
- files[].downloadUrl: Pre-signed GET URL that expires no sooner than 4 hours
Success Response (202 Accepted):
Error Responses:
Get Job Status
Retrieve the current status and progress of a submitted job. Supports efficient polling with ETag caching to minimize unnecessary requests.
Endpoint:
Headers (Recommended for Efficiency):
Include the ETag from your previous response to receive a 304 Not Modified when the job status hasn't changed, saving bandwidth and processing time.
Success Response (200 OK):
Completed Job Response:
Job Status Values:
Job accepted, waiting to start
Files being downloaded and prepared
AI analysis in progress
Evidence Packet ready for download
Polling Best Practices
Efficient Status Monitoring
1. Use ETags for Efficient Polling
Always include the If-None-Match
header with
the ETag from your previous response. This allows the server
to return a 304 Not Modified when nothing has changed.
2. Follow Adaptive Polling Intervals
Use the nextRecommendedPollInSeconds
value from
the API response to optimize your polling frequency and
avoid unnecessary requests.
3-30 seconds
60-600 seconds
Stop polling
3. Handle Rate Limits Gracefully
If you receive a 429 Too Many Requests response, check the
Retry-After
header and wait the specified number of seconds before
retrying.
Error Handling
Common Error Scenarios
Authentication Errors
Validation Errors
Handling File-Level Failures
Individual file failures don't cause the entire job to fail.
Check the
progress.failedFiles
count and
errors
array in the job status response for
details.
Secure File Access Guide
Zero-Trust Security Model
Clarisearch™ implements a zero-trust security model. We never request permanent access to your storage systems or ask you to make files publicly accessible. Your sensitive documents remain in your control at all times.
✅ How It Works
- 1. You generate temporary, pre-signed URLs
- 2. URLs grant access to specific files only
- 3. URLs automatically expire (4+ hours recommended)
- 4. Clarisearch downloads and processes files
- 5. Original files remain in your private storage
- 6. You maintain complete audit trail
🔒 Security Benefits
- • No permanent credential sharing
- • Time-limited access automatically expires
- • File-specific permissions (no bucket access)
- • SOC 2, HIPAA, GDPR compliant
- • Complete data custody retention
- • Zero data retention by Clarisearch
Implementation by Cloud Provider
Google Cloud Storage (Recommended)
Required IAM Permission:
Grant your service account:
roles/storage.objectViewer
Amazon S3
Required S3 Permission:
IAM policy with s3:GetObject
on specific
paths
Microsoft Azure Blob Storage
Security Best Practices
✅ Do This
-
•
Use 4+ hour expiration: Ensures URLs don't expire during processing
-
•
Validate file sizes: Check against 500MB per file limit
-
•
Use dedicated directories: Create "clarisearch/" subdirectories for organization
-
•
Implement least privilege: Grant only necessary read permissions
❌ Avoid This
-
•
Public file URLs: Never make sensitive files publicly accessible
-
•
Short expiration times: URLs expiring in <30 minutes may fail
-
•
Permanent credentials: Never share storage access keys
-
•
Overly broad permissions: Avoid admin/full bucket access
⚠️ Common Troubleshooting
"Status 403" Error: Check URL expiration and service account permissions
"Status 404" Error: Verify file exists and path is correct
Processing Failures: Ensure URLs have 4+ hours before expiration