Skip to main content

Introduction

Dastavez provides intelligent document extraction for Indian documents and browser automation agents for web-based workflows.

Document Extraction

Supported Documents

CategoryDocument Types
IdentityAadhaar, PAN, Voter ID, Passport, Driving License
FinancialBank Statements, ITR, Form 16, Salary Slips
BusinessGST Invoice, GST Returns, Company Registration, MSME Certificate
LegalProperty Documents, Sale Deed, Rental Agreement
EducationMark Sheets, Degree Certificates, Transcripts
HealthcarePrescriptions, Lab Reports, Insurance Claims

Multi-Language OCR

Dastavez supports extraction from documents in:
  • Hindi, English, Tamil, Telugu, Bengali
  • Marathi, Gujarati, Kannada, Malayalam
  • Punjabi, Odia, Assamese

Extraction Quality

FeatureCapability
Accuracy98%+ for standard Indian documents
HandwritingSupported for select fields
Image QualityAuto-enhancement for low-quality scans
ValidationBuilt-in checksum and format validation

Browser Agents

Capabilities

  • Navigate websites and web applications
  • Fill forms and submit data
  • Extract data from web pages
  • Handle authentication flows
  • Take screenshots and generate PDFs

Use Cases

  • Government portal automation (GST, MCA, EPFO)
  • Bank statement downloads
  • Insurance policy retrieval
  • Compliance data collection

Quick Example

from rotavision import Rotavision

client = Rotavision()

# Extract from Aadhaar card
result = client.dastavez.extract(
    document_type="aadhaar",
    file_url="s3://my-bucket/aadhaar-scan.pdf",
    options={
        "mask_number": True,  # Mask Aadhaar number in response
        "extract_photo": True
    }
)

print(f"Name: {result.fields['name']}")
print(f"DOB: {result.fields['dob']}")
print(f"Confidence: {result.confidence}")

# Create browser agent for GST portal
agent = client.dastavez.create_agent(
    name="GST Returns Fetcher",
    capabilities=["navigate", "login", "extract", "download"]
)

# Run workflow
execution = client.dastavez.run_workflow(
    agent_id=agent.id,
    workflow={
        "steps": [
            {"action": "navigate", "url": "https://gst.gov.in"},
            {"action": "login", "credentials_key": "gst_portal"},
            {"action": "extract", "selector": ".returns-table"},
            {"action": "download", "selector": ".gstr1-pdf"}
        ]
    }
)

Endpoints

MethodEndpointDescription
POST/dastavez/extractExtract from document
GET/dastavez/extractions/{id}Get extraction result
GET/dastavez/extractionsList extractions
POST/dastavez/agentsCreate browser agent
GET/dastavez/agents/{id}Get agent details
POST/dastavez/agents/{id}/workflowsRun agent workflow
GET/dastavez/workflows/{id}Get workflow status