Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley...
-
Upload
carol-manning -
Category
Documents
-
view
212 -
download
0
Transcript of Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley...
Audio Manipulation Through Gesticulation
Garrett Fosdick, Jair RobinsonJosé Sanchez
Bradley University - Electrical & Computer EngineeringOctober 6, 2015
Overview
- Background- Design Approach- Economic Analysis- Schedule- Division of Labor- Societal and Environmental Impacts
2
Background
3
Problem
• Audio manipulation over a distance
4
• Audio manipulation while multi-tasking
• Interactivity with music
Problem Background• Similar products
- Playstation Eyetoy- Xbox Kinect
• Difference- Audio interactivity- Purely 2D image tracking
5
Microsoft 2014
Sony 2008
Solution
• Kinetis Tower• Visual and audio input• Programming based
- Hand tracking- Dynamic Time Warping- Tie audio effects to motions
6
Freescale 2014
Leopard Imaging
Solution - Audio Manipulation Through Gesticulation
• Audio manipulation over a distance
7
• Audio manipulation while multi-tasking
• Interactivity with music
- Control while several feet away
- Control with movement of a single hand
- Interact through motion- Reduce repetitiveness of songs
Hand Tracking Design
8
Color Matching Through Zeroing
9
Color Matching Through Zeroing
10
Color Matching Through Zeroing
11
Color Matching Through Zeroing
12
Color Thresholding
13
Color Thresholding
14
Color Thresholding
15
Color Thresholding
16
Motion Thresholding
17
Motion Thresholding
18
Motion Thresholding
19
Motion Thresholding
20
Motion Thresholding
21
Color and Motion Thresholding
22
Color and Motion Thresholding
23
Color and Motion Thresholding
24
Color and Motion Thresholding
25
Search Limiting
26
Results
27
28
Dynamic Time Warping
29
Calculate Difference
- =A B |A-B|
30
Example
359
- =A B |A-B|
359
359
444
222
333
115
137
026
4 2 3359
31
Calculate Cheapest Route To Bottom Right Corner
Difference Cost Search Area
32
Example
|A-B|
115
137
026
Cheapest Cost To Get To That
Square
1
33
Example
|A-B|
115
137
026
Cheapest Cost To Get To That
Square
12
24
34
Example
|A-B|
115
137
026
Cheapest Cost To Get To That
Square
127
249
24
35
Example
|A-B|
115
137
026
Cheapest Cost To Get To That
Square
127
249
2410
36
Example
|A-B|
115
137
026
Cheapest Cost To Get To That
Square
127
249
2410
37
Variations
• Time distortion cost
38
- Non-diagonal movements cost more
• Path killing- Routes over a certain cost are removed
Audio Processing
39
Audio Processing
• Processing time• Analog – digital conversion• 5 audio effects• Finalizing input and output audio
40
Low Pass Filtering• Passing lower Frequencies below cutoff
41Beausievers 2013
High Pass Filtering• Passing higher frequencies above cutoff
42Beausievers 2013
Distortion• Amplifies audio signal to threshold and clips
43Common Wikimedia 2011
Chorus• Sounds like audio is produced by multiple
sources
44Sound on Sound 2004
Reverb• Sound reflecting in a space
45Practical Musical Production 2012
Development and Testing
46
Development
• Tools- Bradley Computers
- Kinetis Freescale Tower- Leopard Imaging USB Camera
- Kinetis IDE- MATLAB
47
Freescale 2014
Leopard Imaging
Testing• Hand Tracking
- Must have a 80% success rate in the following conditions- Outdoor/Indoor lighting- Different movement speeds
(slow/medium/fast)- At least 3 different hands
- Success is tracking a hand for 30 seconds or more
48
Freescale 2014
Leopard Imaging
Testing
• Dynamic Time Warping- Test against already solved matrixes- Capable of gesture matching
success 90% of the time- Gesture matching will occur in the
same scenarios as the hand tracking
49
Freescale 2014
Leopard Imaging
Testing
• Audio processing- 5 audio effects- Quick processing time- Satisfactory auditory results- No crackling or static- Minimize lag to 100 ms or less
50
Freescale 2014
Leopard Imaging
Economic Analysis
51
Development Costs
52
• Software- Kinetis IDE: $ 0.00- MATLAB: $ 0.00 (Provided By School)
• Hardware- Kinetis Freescale Tower: $294.25- Leopard Imaging USB Camera: $91.94- Total: $386.19
Division of Labor
Decide On Hardware/Order
Parts
Research Video Processes, and Test in
Matlab
Research Audio Processes, and Test in
Matlab
Implement Visual Components on
Hardware
Implement Audio Components on
Hardware
Combine Visual and Audio Components
Test and Improve
Done
Garrett
Jair
Both
Key
53
Schedule
54
Societal and Environmental Impacts
• Is it right to alter an artists music• Liability from damage while gesturing• Liability of harm if used improperly
Disclaimer before use to protect
- Advise users to use caution - Check their surroundings
55
Societal and Environmental Impacts
• For avid music listeners• RoHS compliant
56
Conclusion- Background
- Need for more interactivity with music- Solution provides innovative experience with
personal music- Design Approach
- Dynamic Time Warping and hand recognition for gestures
- Program audio effects tied directly with gestures
57
Conclusion-Feasible project to finish within schedule-Environmentally safe-Socially safe with disclaimer and caution
58
Audio Manipulation Through Gesticulation
Garrett Fosdick, Jair RobinsonJosé Sanchez
Bradley University - Electrical & Computer EngineeringOctober 6, 2015
Extra Slides
60
Division of Labor
61
Test Procedures• Camera Input
- Display Footage On A Monitor - PASS/FAIL
• Hand Tracking- Display Footage At The End Of Each Step – PASS/FAIL- Tracks All Test Hands For A Full 30 Seconds
• Gesture Recognition- Light Up An LED When The Gesture Occurs- Must Be Right 90% Of The Time
62
Test Procedures• Audio Input
- Receiving Without Lag And Distortion- PASS/FAIL
• Audio Output- Audio Is Audible At Normal Hearing Level - PASS/FAIL- No Lag Above 100ms Or Unintended Distortion
• Audio Effects- No Lag from Gesture Trigger Above 100ms- Must Trigger from Correct Gesture 100% of time
63
Preliminary Test Results - Gesture
Matching Partial Random
Success Fail Fail64
Preliminary Test Results - Gesture
65
Block Diagram
66
Glass Block Diagram
67
Gantt Chart
68
Schedule
69
Schedule
70
Schedule
71