OpenCV + Yolo + speech Recognition을 이용한 물체 찾기(1)

티스토리 뷰

프로젝트

OpenCV + Yolo + speech Recognition을 이용한 물체 찾기(1)

sikaro 2024. 4. 4. 22:11

간단한 코드.

사진이라고 말하면 openCV를 시작해서 객체 모델을 탐지한다.

멈춰라고 말하면 opecv를 닫는다.

끝내기라고 말하면 프로그램 자체를 끝낸다.

import threading
import speech_recognition as sr
from ultralytics import YOLO
import cv2

# 음성 인식 설정
r = sr.Recognizer()
mic = sr.Microphone()

# 웹캠 캡처 설정
cap = cv2.VideoCapture(0)

# YOLOv8 모델 로드
model = YOLO("yolov8n.pt")

# 물체 인식 스레드 실행 여부 플래그
detect_thread = None
detect_thread_running = False

def listen_and_detect():
    global detect_thread, detect_thread_running

    while True:
        try:
            with mic as source:
                print("말씀해 주세요...")
                audio = r.listen(source)
                text = r.recognize_google(audio, language='ko-KR')
                print(f"인식된 텍스트: {text}")

                if '사진' in text:
                    if not detect_thread_running:
                        detect_thread_running = True
                        detect_thread = threading.Thread(target=detect_objects)
                        detect_thread.start()
                elif '멈춰' in text:
                    detect_thread_running = False
                    if detect_thread is not None:
                        detect_thread.join()
                        detect_thread = None
                elif '끝내기' in text:  # '끝내기' 명령 추가
                    detect_thread_running = False
                    if detect_thread is not None:
                        detect_thread.join()
                    cap.release()
                    cv2.destroyAllWindows()
                    return  # listen_and_detect 함수 종료
        except sr.UnknownValueError:
            print("음성을 인식하지 못했습니다.")
        except sr.RequestError as e:
            print(f"에러가 발생했습니다; {e}")

def detect_objects():
    global detect_thread_running

    while detect_thread_running:
        # 웹캘이 프레임 읽기
        ret, frame = cap.read()

        # YOLOv8 모델로 물체 인식
        results = model(frame)

        # 인식된 물체 위치 표시
        annotated_frame = results[0].plot()
        cv2.imshow('Object Detection', annotated_frame)

        # 'q' 키를 누르면 종료
        if cv2.waitKey(1) & 0xFF == ord('q'):
            detect_thread_running = False
            break

    cv2.destroyAllWindows()


if __name__ == "__main__":
    listen_thread = threading.Thread(target=listen_and_detect)
    listen_thread.daemon = True
    listen_thread.start()
    listen_thread.join()  # 메인 스레드가 종료되지 않도록 하기 위해 추가

시작은 소소하게 한다.

목표는 각 물체를 식별하는 수준까지 가는 것.

시카로의 공부방

티스토리 뷰

OpenCV + Yolo + speech Recognition을 이용한 물체 찾기(1)

티스토리툴바