Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

esp32-s3-box-3运行ai_agent/volc_rtc 唤醒词没反应 (AUD-6011) #1355

Open
tangyiyong opened this issue Jan 21, 2025 · 12 comments
Open

esp32-s3-box-3运行ai_agent/volc_rtc 唤醒词没反应 (AUD-6011) #1355

tangyiyong opened this issue Jan 21, 2025 · 12 comments

Comments

@tangyiyong
Copy link

看见推送了这个volc_rtc就编译烧录测试了一下
环境如下:

esp-adf: master
esp-idf: 5.3.1
esp32-s3-box-3
idf.py set-target esp32s3
idf.py menuconfig
idf.py build flash monitor
申请了rtc的appid,生成了临时token,输出如下:

ESP-ROM:esp32s3-20210327
Build:Mar 27 2021
rst:0x15 (USB_UART_CHIP_RESET),boot:0xa (SPI_FAST_FLASH_BOOT)
Saved PC:0x400450f2
--- 0x400450f2: ets_sha_process in ROM

SPIWP:0xee
mode:DIO, clock div:1
load:0x3fce2820,len:0x1918
load:0x403c8700,len:0x4
load:0x403c8704,len:0xe5c
load:0x403cb700,len:0x303c
entry 0x403c893c
I (26) boot: ESP-IDF v5.3.1-dirty 2nd stage bootloader
I (27) boot: compile time Jan 21 2025 23:55:46
I (27) boot: Multicore bootloader
I (30) boot: chip revision: v0.2
I (34) qio_mode: Enabling QIO for flash chip GD
I (39) boot.esp32s3: Boot SPI Speed : 80MHz
I (44) boot.esp32s3: SPI Mode : QIO
I (49) boot.esp32s3: SPI Flash Size : 16MB
I (54) boot: Enabling RNG early entropy source...
I (59) boot: Partition Table:
I (63) boot: ## Label Usage Type ST Offset Length
I (70) boot: 0 nvs WiFi data 01 02 00009000 00004000
I (77) boot: 1 phy_init RF data 01 01 0000d000 00001000
I (85) boot: 2 factory factory app 00 00 00010000 00300000
I (92) boot: 3 model Unknown data 01 82 00310000 0040e000
I (100) boot: 4 spiffs_data Unknown data 01 82 0071e000 00010000
I (107) boot: End of partition table
I (112) esp_image: segment 0: paddr=00010020 vaddr=3c180020 size=52054h (335956) map
I (171) esp_image: segment 1: paddr=0006207c vaddr=3fc9f900 size=07e2ch ( 32300) load
I (177) esp_image: segment 2: paddr=00069eb0 vaddr=40378000 size=06168h ( 24936) load
I (182) esp_image: segment 3: paddr=00070020 vaddr=42000020 size=179420h (1545248) map
I (420) esp_image: segment 4: paddr=001e9448 vaddr=4037e168 size=116cch ( 71372) load
I (444) boot: Loaded app from partition at offset 0x10000
I (444) boot: Disabling RNG early entropy source...
I (456) octal_psram: vendor id : 0x0d (AP)
I (456) octal_psram: dev id : 0x03 (generation 4)
I (456) octal_psram: density : 0x05 (128 Mbit)
I (461) octal_psram: good-die : 0x01 (Pass)
I (466) octal_psram: Latency : 0x01 (Fixed)
I (472) octal_psram: VCC : 0x00 (1.8V)
I (477) octal_psram: SRF : 0x01 (Fast Refresh)
I (483) octal_psram: BurstType : 0x01 (Hybrid Wrap)
I (489) octal_psram: BurstLen : 0x01 (32 Byte)
I (494) octal_psram: Readlatency : 0x02 (10 cycles@Fixed)
I (500) octal_psram: DriveStrength: 0x00 (1/1)
I (506) MSPI Timing: PSRAM timing tuning index: 6
I (511) esp_psram: Found 16MB PSRAM device
I (515) esp_psram: Speed: 80MHz
I (519) cpu_start: Multicore app
I (1087) esp_psram: SPI SRAM memory test OK
I (1096) cpu_start: Pro cpu start user code
I (1096) cpu_start: cpu freq: 240000000 Hz
I (1096) app_init: Application information:
I (1099) app_init: Project name: volc_rtc
I (1104) app_init: App version: 1
I (1108) app_init: Compile time: Jan 21 2025 23:55:33
I (1114) app_init: ELF file SHA256: d13c2e261...
I (1120) app_init: ESP-IDF: v5.3.1-dirty
I (1125) efuse_init: Min chip rev: v0.0
I (1130) efuse_init: Max chip rev: v0.99
I (1135) efuse_init: Chip rev: v0.2
I (1140) heap_init: Initializing. RAM available for dynamic allocation:
I (1147) heap_init: At 3FCAEC50 len 0003AAC0 (234 KiB): RAM
I (1154) heap_init: At 3FCE9710 len 00005724 (21 KiB): RAM
I (1160) heap_init: At 600FE100 len 00001EE8 (7 KiB): RTCRAM
I (1166) esp_psram: Adding pool of 16384K of PSRAM memory to heap allocator
I (1174) spi_flash: detected chip: gd
I (1178) spi_flash: flash io: qio
I (1182) sleep: Configure to isolate all GPIO pins in sleep state
I (1189) sleep: Enable automatic switching of GPIO sleep configuration
I (1196) main_task: Started on CPU0
I (1206) esp_psram: Reserving pool of 32K of internal memory for DMA/internal allocations
I (1206) main_task: Calling app_main()
I (1226) main: Initialize board peripherals
I (1236) PERIPH_SPIFFS: Partition size: total: 52961, used: 12299
I (1236) AUDIO_THREAD: The esp_periph task allocate stack on internal memory
W (1236) i2c_bus_v2: I2C master handle is NULL, will create new one
I (1246) gpio: GPIO[8]| InputEn: 1| OutputEn: 1| OpenDrain: 1| Pullup: 1| Pulldown: 0| Intr:0
I (1256) gpio: GPIO[18]| InputEn: 1| OutputEn: 1| OpenDrain: 1| Pullup: 1| Pulldown: 0| Intr:0
I (1266) DRV8311: ES8311 in Slave mode
I (1276) gpio: GPIO[46]| InputEn: 0| OutputEn: 1| OpenDrain: 0| Pullup: 0| Pulldown: 0| Intr:0
I (1286) ES7210: ES7210 in Slave mode
I (1296) ES7210: Enable ES7210_INPUT_MIC1
I (1296) ES7210: Enable ES7210_INPUT_MIC2
I (1296) ES7210: Enable ES7210_INPUT_MIC3
W (1306) ES7210: Enable TDM mode. ES7210_SDP_INTERFACE2_REG12: 2
I (1306) ES7210: config fmt 60
I (1306) AUDIO_HAL: Codec mode is 3, Ctrl:1
I (1326) pp: pp rom version: e7ae62f
I (1326) net80211: net80211 rom version: e7ae62f
I (1336) wifi:wifi driver task: 3fcc3948, prio:23, stack:6656, core=0
I (1336) wifi:wifi firmware version: ccaebfa
I (1336) wifi:wifi certification version: v7.0
I (1336) wifi:config NVS flash: enabled
I (1336) wifi:config nano formating: disabled
I (1346) wifi:Init data frame dynamic rx buffer num: 32
I (1346) wifi:Init static rx mgmt buffer num: 5
I (1356) wifi:Init management short buffer num: 32
I (1356) wifi:Init static tx buffer num: 16
I (1366) wifi:Init tx cache buffer num: 32
I (1366) wifi:Init static tx FG buffer num: 2
I (1366) wifi:Init static rx buffer size: 1600
I (1376) wifi:Init static rx buffer num: 16
I (1376) wifi:Init dynamic rx buffer num: 32
I (1386) wifi_init: rx ba win: 16
I (1386) wifi_init: accept mbox: 6
I (1386) wifi_init: tcpip mbox: 32
I (1396) wifi_init: udp mbox: 6
I (1396) wifi_init: tcp mbox: 6
I (1406) wifi_init: tcp tx win: 5760
I (1406) wifi_init: tcp rx win: 5760
I (1406) wifi_init: tcp mss: 1440
I (1416) wifi_init: WiFi/LWIP prefer SPIRAM
I (1416) wifi_init: WiFi IRAM OP enabled
I (1426) wifi_init: WiFi RX IRAM OP enabled
W (1426) wifi:Password length matches WPA2 standards, authmode threshold changes from OPEN to WPA2
I (1436) wifi:Set ps type: 1, coexist: 0

I (1446) phy_init: phy_version 680,a6008b2,Jun 4 2024,16:41:10
I (1476) wifi:mode : sta (60:55:f9:f8:3b:90)
I (1476) wifi:enable tsf
W (1476) PERIPH_WIFI: WiFi Event cb, Unhandle event_base:WIFI_EVENT, event_id:43
I (1496) wifi:new:<7,1>, old:<1,0>, ap:<255,255>, sta:<7,1>, prof:1, snd_ch_cfg:0x0
I (1496) wifi:state: init -> auth (0xb0)
W (1496) PERIPH_WIFI: WiFi Event cb, Unhandle event_base:WIFI_EVENT, event_id:43
I (1506) wifi:state: auth -> assoc (0x0)
I (1506) wifi:state: assoc -> run (0x10)
I (1626) wifi:connected with ho0ome, aid = 19, channel 7, 40U, bssid = 58:ea:1f:70:1d:95
I (1626) wifi:security: WPA2-PSK, phy: bgn, rssi: -39
I (1626) wifi:pm start, type: 1

I (1636) wifi:dp: 1, bi: 102400, li: 3, scale listen interval from 307200 us to 307200 us
I (1636) wifi:set rx beacon pti, rx_bcn_pti: 0, bcn_timeout: 25000, mt_pti: 0, mt_time: 10000
W (1646) PERIPH_WIFI: WiFi Event cb, Unhandle event_base:WIFI_EVENT, event_id:4
I (1666) wifi:idx:0 (ifx:0, 58:ea:1f:70:1d:95), tid:6, ssn:2, winSize:64
I (1666) wifi:idx:1 (ifx:0, 58:ea:1f:70:1d:95), tid:0, ssn:2, winSize:64
I (1696) wifi:AP's beacon interval = 102400 us, DTIM period = 1
I (2646) esp_netif_handlers: sta ip: 192.168.0.246, mask: 255.255.255.0, gw: 192.168.0.1
I (2646) PERIPH_WIFI: Got ip:192.168.0.246
I (2646) audio processor: Create audio pipeline for audio player
I (2656) audio processor: Create audio player audio stream
I (2656) audio processor: Register all elements to playback pipeline
I (2666) audio processor: Link playback element together raw-->audio_decoder-->i2s_stream-->[codec_chip]
E (2676) gpio: gpio_install_isr_service(502): GPIO isr service already installed
E (2686) DISPATCHER: exe first list: 0x0
I (2686) DISPATCHER: dispatcher_event_task is running...
1970-01-01 00:01:14.860 [E] VolcEngineRTCLite.c:105 ****************** HELLO BOOKA (678facc3640b0d00fb10eb44)(1.56.001.58)(6059fcf26792a8820bc81f13662979d531e5504d) ********************
1970-01-01 00:01:14.875 [E] Cache.c:270 operation returned status code: 0x00000009
1970-01-01 00:01:14.889 [E] ThreadPool.c:92 coreid 1 set 1 stack_size 8192 priority 5
I (2726) audio processor: recorder_pipeline_open
I (2726) audio processor: Create audio pipeline for recording
I (2736) audio processor: Create player audio stream
I (2746) audio processor: Register all player elements to audio pipeline
I (2746) audio processor: Link all player elements to audio pipeline
I (2756) audio processor: player_pipeline_open
I (2766) audio processor: Create audio pipeline for playback
I (2766) audio processor: Create playback audio stream
I (2776) audio_stream_7210: Create opus decoder
I (2776) audio processor: Register all elements to playback pipeline
I (2786) audio processor: ENBALE_AUDIO_STREAM_DUAL_MIC
I (2796) audio processor: Link playback element together raw-->audio_decoder-->rsp-->i2s_stream-->[codec_chip]
I (2806) audio processor: player pipe start running
I (2806) volc_rtc: start join room

1970-01-01 00:01:15.299 [E] RoomImplX.c:167 operation returned status code: 0x52000057
1970-01-01 00:01:15.669 [E] Cache.c:309 operation returned status code: 0x00000009
1970-01-01 00:01:15.674 [E] RoomImplX.c:167 operation returned status code: 0x52000057
1970-01-01 00:01:15.675 [E] LiteHttp.c:641 ID 940840976 E_LOGIC : NO need keepAlive
1970-01-01 00:01:15.684 [E] RoomImplX.c:167 operation returned status code: 0x52000057
1970-01-01 00:01:15.804 [E] RoomImplX.c:167 operation returned status code: 0x52000057
I (4076) volc_rtc: join channel success chatbot elapsed 385 ms now 385 ms

I (4076) volc_rtc: join room success

I (4076) RAW_OPUS_ENC: Raw Opus encoder init
I (4086) MODEL_LOADER: The storage free size is 14464 KB
I (4086) MODEL_LOADER: The partition size is 4152 KB
I (4096) MODEL_LOADER: Successfully load srmodels
I (4096) RECORDER_SR: The first wakenet model: wn9_nihaoxiaozhi_tts

I (4106) AFE_SR: afe interface for speech recognition

I (4116) AFE_SR: AFE version: SR_V220727

I (4116) AFE_SR: Initial auido front-end, total channel: 3, mic num: 2, ref num: 1

I (4126) AFE_SR: aec_init: 1, se_init: 0, vad_init: 0

I (4136) AFE_SR: wakenet_init: 0

I (4286) AFE_SR: wake num: 2, mode: 0, (Sep 4 2024 11:49:31)

I (4286) AUDIO_RECORDER: RECORDER_CMD_TRIGGER_START
I (4286) main_task: Returned from app_main()

唤醒词没有响应,可能是什么原因?
谢谢!

@github-actions github-actions bot changed the title esp32-s3-box-3运行ai_agent/volc_rtc 唤醒词没反应 esp32-s3-box-3运行ai_agent/volc_rtc 唤醒词没反应 (AUD-6011) Jan 21, 2025
@qixinyo
Copy link

qixinyo commented Jan 23, 2025

大哥,你有碰到日志报
E (3022) i2c.master: I2C transaction unexpected nack detected
E (3032) i2c.master: s_i2c_synchronous_transaction(888): I2C transaction failed
E (3042) i2c.master: i2c_master_transmit(1116): I2C transaction failed
这个i2c错误的问题吗

@tangyiyong
Copy link
Author

没有碰到啊,板子完好的话,不应该有I2C的问题吧
刚注意日志里时间好像不对,1970-01-01.。。

@qixinyo
Copy link

qixinyo commented Jan 23, 2025

你是啥板子了,板子上就带有对应音频模块的那种吗

@tangyiyong
Copy link
Author

esp32-s3-box-3

@qixinyo
Copy link

qixinyo commented Jan 23, 2025

老哥,可以加你交流一下吗,我现在是在做sparkbot,一个桌面机器人,然后音频解码是es8311,单片机是esp32s3n16r8,现在实现的是文心一言的和小智的,但是无法实时对话,所以想把这套代码和进去

@PoohWoah
Copy link

PoohWoah commented Feb 8, 2025

老哥,可以加你交流一下吗,我现在是在做sparkbot,一个桌面机器人,然后音频解码是es8311,单片机是esp32s3n16r8,现在实现的是文心一言的和小智的,但是无法实时对话,所以想把这套代码和进去

这套有点贵 你还得出RTC费用,进房间就开始算RTC费用

@HaydnLiao
Copy link

大哥,你有碰到日志报 E (3022) i2c.master: I2C transaction unexpected nack detected E (3032) i2c.master: s_i2c_synchronous_transaction(888): I2C transaction failed E (3042) i2c.master: i2c_master_transmit(1116): I2C transaction failed 这个i2c错误的问题吗

我也是,原来能正常用,后面就报这个错误。

@ALToast
Copy link

ALToast commented Feb 10, 2025

大哥,你有备份日志报告 E (3022) i2c.master: I2C transaction意外nack检测到 E (3032) i2c.master: s_i2c_synchronous_transaction(888): I2C transaction failed E (3042) i2c.master: i2c_master_transmit(1116): I2C transaction failed 这个i2c错误的问题吗

Hi @qixinyo,
i2c hw nack 问题请参考 #I2C fails with hw NACK on ES8388 while I2S stream is running (AUD-5942) #1334

@tianrongqin
Copy link

请问这个问题解决了吗?我也出现该问题了

@JlaYun
Copy link

JlaYun commented Feb 18, 2025

怎么样?我也是这样?有解决方案吗?

@shootao
Copy link

shootao commented Feb 18, 2025

@tangyiyong
现在默认模式是 连续交互模式, 使用 唤醒的话, 可以menuconfig 配置成 LANGUAGE_WAKEUP_MODE 模式

@JlaYun
Copy link

JlaYun commented Feb 19, 2025

@tangyiyong 现在默认模式是连续交互模式,使用唤醒的话,可以menuconfig配置成LANGUAGE_WAKEUP_MODE模式

唤醒和还是不能对话,查看VolcEngineRTCLite对应的api发现已经发送到智能体所在rtc房间了

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants