Acoustic Camera · Counter-UAS

AkiraEar

Passive acoustic 3D camera for drone detection, classification and localization.

Six identical panels in a half-dodecahedron — each an ESP32-S3 + PCMD3180 + 8× IM70D122 MEMS array. The first system to detect, classify and track drones on an embedded MCU — no FPGA, no cloud.

ODASAM-indexGCC-PHATKalman SST
AkiraEar acoustic camera — half-dodecahedron MEMS array
48
MEMS microphones (6×8)
steradian coverage
≤ 3°
angular accuracy
≈$207
full system BOM

Three Independent Layers

Each computing layer solves one question — from raw PDM to a tracked 3D target

Layer 1

ODAS SSL

GCC-PHAT cross-correlations, TDOA. "Where is the sound?" — DOA candidates (azimuth, elevation) over a two-level sphere search.

Layer 2

Drone Discriminator

AM-index + Spectral Variance. "Is it a drone?" — two physics-based discriminators filter candidates (Patent Claims 1+2).

Layer 3

ODAS SST

Kalman-filter tracking. "Where is it heading?" — stable 3D DOA output for confirmed targets at 10 Hz.

Hardware

PCMD3180 delays

Per-channel programmable delay registers — hardware TDOA presteering at the PDM level, before digitization.

First system without an FPGA

Commercial acoustic cameras can locate sound — none of them tell a drone apart from the wind

AkiraEarFluke ii915FOTRIC TD2SonoCam
ChipESP32-S3FPGAFPGAFPGA
MEMS / system48 (6×8)646464
Coverage2π (3D)63°×63°58°×45°66°×52°
Drone discrim.AM-index + VarNoNoNo
System price≈$200 BOM$15–25k$10–20k$8–15k
Cloud-freeYesPartialPartialNo
Patent3 claimsNoNoNo

One Panel, All On-Chip

ESP32-S3 + PCMD3180 + 8 MEMS — one board, ~$33 BOM, fully repeatable ×6

AkiraEar single panel PCB — spiral sparse MEMS layout

ESP32-S3-WROOM-1

1× per panel

Xtensa LX7 dual-core @ 240 MHz with PIE SIMD, 8 MB PSRAM, 16 MB Flash, native WiFi + BLE 5.0. Everything on one chip.

PCMD3180

8-ch PDM→TDM

Texas Instruments converter in master mode — generates BCLK/FSYNC, per-channel hardware delays, MICBIAS up to 20 mA.

8× IM70D122

SNR 70 dB · IP57

Infineon MEMS, -26 dBFS, 30 Hz low cutoff for blade-pass frequencies. Spiral sparse, Acoular-optimized array per panel.

Half-dodecahedron

6 panels · 48 MEMS

116.565° dihedral → ~64° between panel boresights. Main lobes meet at -3 dB: 2π steradian coverage, no moving parts.

Spiral sparse, half-dodecahedron

Each panel uses an Acoular-optimized spiral sparse array — non-redundant baselines, large aperture, no grating lobes up to 2 kHz. Six panels at the dodecahedron's 116.565° dihedral angle place their boresights ~64° apart, so neighboring main lobes meet at the -3 dB level.

PSF < 10° at 500 Hz, < 5° at 1 kHz
100 Hz – 2 kHz band — drone blade-pass harmonics
2π steradian coverage with no mechanical drive
168 unique TDOA pairs across 48 microphones
AkiraEar half-dodecahedron 6-panel configuration

Performance Budget

~20 ms on a 32 ms frame — 62.5% Core 0 load per panel, with headroom on Core 1

TaskTime / frameCore
I2S TDM DMA (8ch)< 0.1 msHW
FFT 512pt × 8ch6.4 msCore 0
Mel filterbank × 82.4 msCore 0
GCC-PHAT 28 pairs (SSL)8.0 msCore 0
DOA sphere search3.0 msCore 0
AM-index × N0.5 msCore 1
Spec-var × N0.3 msCore 1
Kalman SST update1.0 msCore 1
WiFi MQTT publish< 1 msCore 1

Expected Performance

Detection range
50–300 m (Ryze Tello, 6-panel system)
Angular accuracy (1 panel)
≤ 5° azimuth & elevation
Angular accuracy (6 panels)
≤ 3° (multi-panel fusion)
Coverage
2π steradian (full upper hemisphere)
False positive rate
< 1% (AM-index + Spec-Var double filter)
True positive rate
≥ 85% in steady hover
Detection latency
< 1 s drone appearance → first MQTT event
DOA update latency
< 100 ms (SST @ 10 Hz)
System power
≤ 12 W @ 5 V
System BOM
≈ $207 (×50 cheaper than Squarehead G2+)

Build counter-UAS with PenEngineering

AkiraEar brings FPGA-class acoustic detection to a $200 embedded platform. We're inviting partners and integrators to co-develop passive, cloud-free drone detection — from perimeter security to airspace monitoring.