acoustsee

AcoustSee

a photon to phonon code

Introduction

This AcoustSee repository contains javascript algorithms tgat aims to transform a visual environment, from a camera for example, into a intuitive soundscape in a synesthesia transform, empowering the user to experience the visual world by audio cues.

Why? We believe in solving real problems with open-source software in a fast, accessible, and impactful way. You are invited to join us to improve and make a difference!

Project Vision

Our synesthesia ins this project is the translation from a visual to a sound, with a user experiencie for a visually challenged user. With this concept in mind the approach is to aid a user navigation and synthesia acustical visual perception in real time. Taking a camera imput into a soundscape, where a sidewalk could have a distintive spcetrum signature that is being heard by both hears, a wall at the left with its own distintive spectrum signature syntheses another sounds, a car, a hole… a light… and so on… you catch where im going? No?, lets try another example.

Imagine a person that is unable to see, sitting at a park with headphones on and paired to a mobile phone. This phone is being weared like a necklage with the camera facing a quiet swing, as the seat of the swing gets back/further the sound generator makes a sound spectra that has less harmonics content, lower volume and wen it swings closer its spectra complexity raises.

This project aims to make this imagination into a reality, with its first milestones coded entirely by xAI Grok and now to the open source community to enhace humanity perception.

Usage

The latest stable version can be run from

https://mamware.github.io/acoustsee/present

Previous versions can be found at

https://mamware.github.io/acoustsee/past

Unstable versions currently being developed can be found at

https://mamware.github.io/acoustsee/future

To use it, having the most up to date version of mobile web browsers is diserable, yet most mobile internet browsers from 2021 should work.

For a complete mobile browser compability list check the doc Usage there you also find instruccions to run the first prof of concept made with Python

Hardware needed:

A mobile phone/cellphone from 2021 and up, with a front facing camera and headphones.

Steps to initialize

The webapp is designed to be used with a mobile phone where its front camera (and screen) are facing the desired objetive to be transformed in to sound, wearing the mobile phone like a necklage is its first use case in mind.
Enter https://mamware.github.io/acoustsee/present (or your version of preference from Usage)
The User Interface of the webapp is split into five regions,
- Center rectangle: Audio enabler, a touchplace holder that enables the webpage to produce sound.
- Top border rectangle: Settings SHIFTer button
- Bottom rectangle: Start and Stop button
- Left rectangle: Day and night switch for light logic inversion
- Right rectangle: Languaje switcher
- SHIFTed left rectangle (settings enabled): Grid selector, changes how the camera frames or “grids” the environment
- SHIFTed right rectangle (settings enabled): Audio engine selector, changes how the sound synthetizer reacts to the selected grid.

IMPORTANT: The processing of the camera is done privately on your device and not a single frame is sent outside your device processor. A permision to access the camera by the browser will be requested in order to do this local processing and thus generate the audio for the navigation.

Status

Milestone 4 (Current): Developing in Progress /future

New user interface with selectable grid and synth engine
Adding Spanish to the speech sinthetizer
Modular V3, educational purpose ready (JSDoc)
Splited the UI Logic, breaking ui-handlers.js into smaller modules to isolate trapezoid button handlers, settings dropdowns, and frame processing.
WCAG Contrast UI.
Dynamic memplates, creating templates.js module to generate UI elements programmatically, reducing HTML duplication.
Centralized event management, introduced an event-dispatcher.js to route UI events to specific handlers, improving scalability.

Changelog

Current version is v0.9.7, follow link above for a log history, details and past milestones

Project structure

acoustsee/

├── present/                      # Current Stable Modular Webapp
│   ├── index.html
│   ├── styles.css
│   ├── main.js
│   ├── state.js
│   ├── audio-processor.js
│   ├── grid-selector.js
│   ├── ui/
│   │   ├── rectangle-handlers.js # Handles settingsToggle, modeBtn, languageBtn, startStopBtn
│   │   ├── settings-handlers.js  # Manages gridSelect, synthesisSelect, languageSelect, fpsSelect
│   │   ├── frame-processor.js    # Processes video frames (processFrame)
│   │   └── event-dispatcher.js   # Routes events to handlers
│   └── synthesis-methods/
│       ├── grids/
│       │   ├── hex-tonnetz.js
│       │   └── circle-of-fifths.js
│       └── engines/
│           ├── sine-wave.js
│           └── fm-synthesis.js
│   
├── tests/                     # Unit tests (TO_DO)
│   ├── ui-handlers.test.js
│   ├── trapezoid-handlers.test.js
│   ├── settings-handlers.test.js
│   └── frame-processor.test.js
├── docs/                      # Documentation
│   ├── USAGE.md
│   ├── CHANGELOG.md
│   ├── CONTRIBUTING.md
│   ├── TO_DO.md
│   ├── DIAGRAMS.md
│   ├── LICENSE.md
│   └── FAQ.md
├── past/                     # Historic repository, older versions.
├── future/                   # Meant to be used for fast, live testing of new features and improvements
└── README.md

Contributing

Please follow the link above for the detailed contributing guidelines, branching strategy and examples.

To-Do List

At this document linked above, you will find the list for current TO TO list, we are now at milestone 4 (v0.9)

Resume of TO_DO:

Haptic feedback via Vibration API (in progress at v0.9.8.8)
Console log on device screen and mail to feature for debuggin. (in progress at v0.9.8.8)
New languajes for the speech sinthetizer
Audio imput from camera into the headphones among the synthetized sound from camera.
Further Modularity: e.g., modularize audio-processor.js
Optimizations aiming the use less resources and achieve better performance, ie: implementing Web Workers and using WebAssembly.
Reintroducing Hilbert curves.
Gabor filters for motion detection.
New grid types and synth engines
Voting system for grid and synth engines.
Consider making User selectable synth engine version.
Consider adding support for VST like plugins.
Testing true HRTF, loading CIPIC HRIR data.
New capabilities like screen/video capture to sound engine.
Android/iOS app developtment if considerable performance gain can be achieved.
Mermaid diagrams to reflect current Modular Single Responsability Principle

Code flow diagrams

Diagrams covering the Turnk Based Development approach.

Reflecting:

Process Frame Flow
Audio Generation Flow
Motion Detection such as oscillator logic.

License

GPL-3.0 license details

FAQ

Follow the link for list of the Frecuently Asqued Questions.

This site is open source. Improve this page.

AcoustSee

Project Vision

Table of Contents

Hardware needed:

Steps to initialize