Facecomm
  • Facecomm
  • Business Docs
    • About Us
    • Our Team
      • Rajeev Singh
      • Amit Rai
      • Sourav Sen Gupta
      • Shreyas Mangalgi
      • Parminder Singh
      • Renish Bhimani
      • Vindhya Chandrasekharan
      • Nitin Rana
    • Understanding Communication
    • Market Study
    • Business Use Cases
    • Product Positioning
    • Product Market Fit & Value Prop
    • Risk Mitigations
    • Product Roadmap
  • Tech Documentation
    • Our Solution
    • Technology Stack
    • VOIP (Voice Over IP)
    • WebRTC
    • AI in Video Conferencing
  • Features & Roadmap
    • Product Functionality Buckets
    • Hackathon Roadmap
Powered by GitBook
On this page
  • AI for Security
  • AI for Accessibility
  • AI for Efficiency
  • AI for User Experience

Was this helpful?

Export as PDF
  1. Tech Documentation

AI in Video Conferencing

PreviousWebRTCNextProduct Functionality Buckets

Last updated 5 years ago

Was this helpful?

AI for Security

Recent advances in the field of Computer Vision has led to a multitude of applications in the field of facial/fingerprint/voice recognition. Deep neural networks based on open source libraries like Pytorch and TensorFlow can be used to develop highly accurate . This can be used for adding an extra layer of authentication when customers join a meeting room.

AI for Accessibility

Advances in Natural language processing have made implementation of real-time language translation a reality. State of the art open-source Neural Machine Translation (NMT) systems are able to understand human language and translate them into multiple languages. Open source solutions are available for . Advances have also been made in the field of real-time translation of which can help people with hearing challenges.

AI for Efficiency

AI for User Experience

AI can be used to incorporate a whole host of features to bring delight to end-users. These include (not limited to),

  • Automatically capturing minutes & highlights of the meeting in the form of audio, video and text for records and sharing.

  • Chatbot support to provide minutes of the meeting for absentee participants.

Efficient communication requires virtual meetings to happen without distractions. Distractions can be in the form of audio or video like background audio noise or visual background of the participant. Visual backgrounds can be removed real-time using deployed on the edge devices using TensorFlowJS. Noise is a form of audio distraction, and machine learning algorithms can help in real-time removal of stationary and non-stationary audio noise.

deep learning models
facial recognition systems
real-time closed captioning of videos
audio to sign language
If it has audio, now it can have captionsGoogle
Logo
DeepSign : Indian Sign Language (Phase 0)Medium
U-Net: Convolutional Networks for Biomedical Image SegmentationarXiv.org
Logo
Logo
6MB
Deep Face Recognition.pdf
pdf
Deep Face Recognition
2MB
Convolutional Networks for Biomedical Image Segmentation.pdf
pdf
Convolutional Networks for Biomedical Image Segmentation