Vision-Based Cooking Assistant (MS Thesis Prototype)

MS Thesis Archive (2024)

This repository contains the prototype implementation developed for my 2024 master's thesis as part of a joint-degree program between Japan Advanced Institute of Science and Technology (JAIST) and Kanazawa University (The degree was conferred by JAIST as the home institution).:

"Voice Assistance Using Visual Information in Continuous Cooking Tasks to Extend User Experience" Japan Advanced Institute of Science and Technology (JAIST), March 2024

Internal project codename: VAs-got-vision

The system explores how voice-based cooking assistants can be enhanced by incorporating real-time visual context, such as object detection and environmental cues, to support continuous interaction and reduce user confusion during tasks.

To model the asynchronous nature of user behavior, perception, and system feedback in a cooking scenario, the prototype was implemented using an actor-based system architecture in Rust, where components such as visual processing, voice input, and interaction logic were isolated and coordinated via message passing.

This design aimed to simulate real-time modular responsiveness in complex interaction flows.

This code is not part of any published paper.
It is provided for archival purposes and technical reference only.

Project Summary

Thesis Title: Voice Assistance Using Visual Information in Continuous Cooking Tasks to Extend User Experience
Institution: Japan Advanced Institute of Science and Technology (JAIST)
Supervisor: Prof. Takaya YUIZONO, JAIST (ORCID)
Second Supervisor: Prof. Naoki SUGANUMA, Kanazawa University (J-Global)
Completion: March 2024
Status: Research discontinued; not integrated into later projects.

Technologies Used

Rust (Actor-based architecture) for modular and asynchronous component communication
OpenCV for visual input and experimental recognition pipelines
Voice recognition: Picovoice SDK

Disclaimer

This code was part of an early-stage prototype and is not production-ready.
Some parts may depend on deprecated APIs or internal tools.
No official support is provided.

Citation (IEEE style)

If you use or refer to this project, please cite it as follows:

[1] S. Rhie, “Voice Assistance Using Visual Information in Continuous Cooking Tasks to Extend User Experience”, M.S. thesis, Japan Advanced Institute of Science and Technology (JAIST), joint-degree program with Kanazawa University, 2024. [Online]. Available: https://github.com/noxowl/vision-cooking-assistant-ms-thesis

License

This project is licensed under the BSD 3-Clause License.

You are free to use, modify, and distribute this code for non-commercial or commercial purposes, provided that the original copyright notice and this license are included
in all copies or substantial portions of the Software.

Note: Redistribution using the names of the original author or affiliated institutions for endorsement is not permitted.

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
.cargo		.cargo
resources/images		resources/images
src		src
.gitignore		.gitignore
CITATION.cff		CITATION.cff
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
color_picker.py		color_picker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Vision-Based Cooking Assistant (MS Thesis Prototype)

Project Summary

Technologies Used

Disclaimer

Citation (IEEE style)

License

About

Uh oh!

Releases

Packages

Languages

License

noxowl/vision-cooking-assistant-ms-thesis

Folders and files

Latest commit

History

Repository files navigation

Vision-Based Cooking Assistant (MS Thesis Prototype)

Project Summary

Technologies Used

Disclaimer

Citation (IEEE style)

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages