Skip to Content

Data Models

Core Data Model

This ER diagram shows the relationships between core entities in the NeuraScale system.

Core Entities:

EntityDescriptionPrimary Storage
UserSystem users (researchers, clinicians)PostgreSQL
DeviceNeural recording devicesPostgreSQL
SessionRecording sessionsPostgreSQL
RecordingContinuous data from a devicePostgreSQL + Bigtable
DataChunkCompressed time-series segmentsBigtable
SampleIndividual data pointsBigtable
FeatureExtracted features from recordingsBigQuery
ClassificationML model predictionsBigQuery
EventSession events and markersPostgreSQL
DeviceStatusReal-time device metricsRedis + PostgreSQL

Data Volume Estimates:

  • Users: ~1,000
  • Devices: ~10,000
  • Sessions: ~100,000/month
  • Recordings: ~1M/month
  • Samples: ~100B/month (at 250Hz)
  • Features: ~10M/month
  • Classifications: ~10M/month

Clinical Data Model

This ER diagram shows the HIPAA-compliant clinical data model for patient management and medical records.

Clinical Data Entities:

EntityDescriptionEncryption
PatientDe-identified patient recordsPII encrypted
ClinicianLicensed healthcare providersCredentials encrypted
ConsentPatient consent recordsDigitally signed
ClinicalSessionMedical recording sessionsAudit trail
ClinicalRecordingMedical-grade neural recordingsEncrypted at rest
ClinicalAnnotationClinical observations and notesEncrypted
BiomarkerExtracted clinical metricsReference ranges
ReportClinical reports and findingsEncrypted, versioned
ReportDistributionReport access trackingAudit log

Clinical Workflows:

  • Patient registration with consent
  • Pre-session impedance checks
  • Recording with clinical annotations
  • Automated biomarker extraction
  • Report generation and distribution
  • Follow-up scheduling

Time-Series Data Model

This section details the optimized data model for high-frequency neural signal storage and retrieval.

Time-Series Storage Strategy:

Storage TierRetentionAccess SpeedCostUse Case
Hot24 hours< 1ms$$$Real-time streaming
Warm30 days< 10ms$$Recent analysis
ColdUnlimited< 1s$Long-term archive

Data Characteristics:

  • Sample rate: 250-1000 Hz
  • Channels: 1-256
  • Data type: Float32 (4 bytes)
  • Raw throughput: Up to 1 MB/s per device
  • Compression ratio: 3:1 to 5:1

Optimization Techniques:

  • Delta encoding for sequential samples
  • Chunking for parallel processing
  • Column-oriented storage for channels
  • Bloom filters for quick existence checks
  • Pre-computed aggregations
Last updated on