Se ha denunciado esta presentación.
Se está descargando tu SlideShare. ×

Social Computing Research in India

Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Anuncio
Próximo SlideShare
Console Video Game History
Console Video Game History
Cargando en…3
×

Eche un vistazo a continuación

1 de 53 Anuncio

Más Contenido Relacionado

Presentaciones para usted (20)

Similares a Social Computing Research in India (20)

Anuncio

Más de IIIT Hyderabad (20)

Más reciente (20)

Anuncio

Social Computing Research in India

  1. 1. Social Computing Research in India https://www.linkedin.com/in/ponguru/ Feb 4, 2023 IIIT Hyderabad Ponnurangam Kumaraguru (“PK”) #ProfGiri CS IIIT Hyderabad ACM Distinguished Member TEDx Speaker https://www.instagram.com/pk.profgiri/
  2. 2. What is Social Computing? 2 https://en.wikipedia.org/wiki/Social_computing
  3. 3. 3
  4. 4. Legal AI for Indian Context District courts are usually the first point of contact between the people and the judiciary. Lower courts in India are burdened with a backlog of cases (~40 million as of 2021). Local languages used in the documents filed in district courts in India. 4 Supreme Court High Courts District Courts
  5. 5. Legal AI / NLP - Data We collected ~900k district court case documents from Uttar Pradesh All documents in Hindi, written in Devanagari There are legal corpora for European Court of Justice and Chinese courts, none for Indian district courts 5
  6. 6. Legal AI / NLP - Data There are around 300 different case types, table shows the prominent ones Majority of the case documents correspond to Bail Applications 6 Variation in number of case documents per district Case types in HLDC
  7. 7. Legal AI / NLP - Bail Documents 7 District-wise ratio of number of bail applications to total cases
  8. 8. Legal AI / NLP - Bail Prediction Model 8 In general, the performance is lower in district-wise settings, possibly due to large variation across districts Overall, summarization models perform better than Doc2Vec and simpler Transformer-based models
  9. 9. Legal AI / NLP for Indian Context 9 HLDC: Hindi Legal Documents Corpus
  10. 10. Legal AI / NLP for Indian Context - Takeaways Indian Legal documents are a rich a source of domain-specific Indic- language corpora, readily available online. Multiple tasks still need attention especially for Indian settings Legal Summarization Case recommendations Citation predictions / network Sleeping beauty Bias 10
  11. 11. Where to start if you are interested? Saptarshi Ghosh https://sites.google.com/site/saptarshighosh/ Ashutosh Modi https://ashutosh-modi.github.io/ Prathamesh Kalamkar https://scholar.google.com/citations?user=7RaVib0AAAAJ&hl=en&oi=ao OpenNyAI https://opennyai.org/ Ilias Chalkidis https://iliaschalkidis.github.io/ Off course, Precog J https://precog.iiit.ac.in/ 11
  12. 12. 12
  13. 13. Code-mix computationally challenging hai. Predominantly noticed in social networks and speech data Social Media text processing poses certain challenges @, #, https:// Incorrect Spelling & Romanisation Mixing two + languages – Hinglish 13 Paper at ACL Findings 2022
  14. 14. Problems / Applications Identify code mix sentences in wild Generating natural code mix sentences Downstream application in dialog systems, human computer interface Making sense of the social content can help in making choices, recommendations, civic services, etc. 14
  15. 15. Where to start if you are interested? Monojit Choudhury https://www.microsoft.com/en- us/research/people/monojitc/ Preethi Joyti https://www.cse.iitb.ac.in/~pjyothi/ Vivek Srivastava https://sites.google.com/view/vivek-srivastava/ Pushpak Bhattacharyya https://www.cse.iitb.ac.in/~pb/ Thamar Solorio http://solorio.uh.edu/ Off course, Precog J https://precog.iiit.ac.in/ 15
  16. 16. 16
  17. 17. 17 Follower - Following
  18. 18. 18 Verified handles: 71 (2014), 1,268 (2019)
  19. 19. 19
  20. 20. 20 https://precog.iiit.ac.in/blog/
  21. 21. 21
  22. 22. “I won the election” 22 https://ojs.aaai.org/index.php/ICWSM/article/view/18110/17913
  23. 23. Where to start if you are interested? Joyojeet Pal https://joyojeet.people.si.umich.edu/ Ashwin Rajadesingan https://ashwinrajadesingan.com/ Trivedi Centre for Political Data https://tcpd.ashoka.edu.in/ Off course, Precog J https://precog.iiit.ac.in/ 23
  24. 24. 24
  25. 25. 25
  26. 26. 26
  27. 27. 27
  28. 28. 28 WhatsFarzi
  29. 29. 29 https://precog.iiit.ac.in/pubs/SpotFake-IEEE_BigMM.pdf https://precog.iiit.ac.in/pubs/SpotFake_plus_AAAI.pdf https://precog.iiit.ac.in/pubs/FactDrill_ICWSM2022_final_version.pdf
  30. 30. FactDrill 30 FactDrill: A Data Repository of Fact-Checked Social Media ...https://ojs.aaai.org › index.php › ICWSM › article › view
  31. 31. FactDrill 31 22,435 fact-checked social media content 2013-2020 14 Languages https://www.shutterstock.com/search/video-text https://www.nicepng.com/ourpic/u2q8u2r5u2o0q8q8_clock-time-machine-clip-art-clipart-panda-time/ FactDrill: A Data Repository of Fact-Checked Social Media ...https://ojs.aaai.org › index.php › ICWSM › article › view https://www.clipartmax.com/middle/m2H7d3G6N4i8N4i8_data-database-server-sql-storage-icon-fleet-management-infographics/ Presence of multiple modalities
  32. 32. Where to start if you are interested? David G Rand https://davidrand-cooperation.com/ Gordon Pennycook https://gordonpennycook.com/ Neelanjan Sircar https://twitter.com/neelanjansircar Sumitra Badrinathan https://sumitrabadrinathan.github.io/ Off course, Precog J https://precog.iiit.ac.in/ 32
  33. 33. Selfie 33
  34. 34. 34
  35. 35. 35
  36. 36. 36
  37. 37. 37 https://www.facebook.com/saftiebot/
  38. 38. 38 https://goo.gl/2sIdYT
  39. 39. 2,000+ location, 400+ verified 39
  40. 40. 40 http://precog.iiitd.edu.in/blog/2016/11/killfie-journey/
  41. 41. Samsung #SafeIndia 41
  42. 42. 42
  43. 43. 43 http://bit.ly/saftie-cam
  44. 44. 44 https://precog.iiit.ac.in/pubs/camera-to-deathbed-icwsm2017.pdf ICWSM 2017
  45. 45. 45 https://precog.iiit.ac.in/research/distracted_driving/ ICWSM 2020
  46. 46. 46
  47. 47. 47 https://precog.iiit.ac.in/pubs/Effect_of_Popularity_Shocks_on_User_Behavior-CR.pdf
  48. 48. 48 https://precog.iiit.ac.in/pubs/Warning_It's_a_scam_CODS-COMAD_2023.pdf
  49. 49. Takeaways No dearth of problems to study… Many problems are resource poor, e.g. code mix is low resource, hard to collect corpus while being abundant in social networks …. Social impact & utility of our work Will be happy to discuss anything further… Looking for (including PhD) students / RAs … 49
  50. 50. 50 https://precog.iiit.ac.in/pages/publications.html
  51. 51. 51 https://precog.iiit.ac.in/
  52. 52. 52 https://forms.gle/HW2b3kpsXweuD3w98
  53. 53. 53 Thanks! Questions? pk.guru@iiit.ac.in http://precog.iiit.ac.in/ @ponguru pk.profgiri linkedin/in/ponguru

×