Email Discovery Architecture
Pattern-based email generation from employee names and company domains with SMTP validation
Internal Components
Service Configuration
Replicas3
Poll Interval60s
Batch Size10 employees
Patterns18+ templates
Database Outputs
employees.emailDiscovered email addressemployees.email_verifiedVerification statusemployees.email_finderProcessing flagPipeline Position
Upstream
Social Collectors (names + domains)
Downstream
Email Validator (SMTP check)
Component Breakdown
Pattern Generator
18+ email patterns: first.last (~13-20%), flast (~15%), first (~50-61% small orgs), and 15 more variants.
SMTP Validator
Via Reacher (self-hosted). Tests each candidate with RCPT TO. Stops on first valid result for efficiency.
Greylisting Resilience
3 retries with exponential backoff (20s initial, 2x multiplier, 320s max) to handle greylisting.
Result Writer
Sets email + email_verified on employee record. Marks email_finder=TRUE to skip on future runs.