Data Science and Artificial Intelligence

The research group has over 25 years of experience in data science and artificial intelligence. They have made fundamental contributions in several areas, including content-based image retrieval, statistical modeling and machine learning based image tagging, image clustering, and statistial clustering of discrete distributions. They have also developed some earlier methods for big data driven weather prediction and disaster simulation.

The following is an archive of our publications in this area. They are in reverse chronical order.

    Image Tagging and Classification
  1. Yukun Tian, Yiming Lei, Junping Zhang and James Z. Wang, ``PaDNet: Pan-Density Crowd Counting,'' IEEE Transactions on Image Processing, vol. 29, no. 3, pp. 2714-2727, 2020. (download) (g-scholar)
  2. Qi Zhou, Junping Zhang, Lingfu Che, Hongming Shan and James Z. Wang, ``Crowd Counting with Limited Labeling through Submodular Frame Selection,'' IEEE Transactions on Intelligent Transportation Systems, vol. 20, no. 5, pp. 1728-1738, 2019. (download) (g-scholar)
  3. Jianbo Ye, Panruo Wu, James Z. Wang and Jia Li, ``Fast Discrete Distribution Clustering Using Wasserstein Barycenter with Sparse Support,'' IEEE Transactions on Signal Processing, vol. 65, no. 9, pp. 2317-2332, 2017. [A version was posted in Sep 2015 at http://arxiv.org/abs/1510.00012] (download) (g-scholar)
  4. Jianbo Ye, James Z. Wang and Jia Li, ``A Simulated Annealing Based Inexact Oracle for Wasserstein Loss Minimization,'' Proceedings of the International Conference on Machine Learning, vol. PMLR 70, 3940-3948, Sydney, Australia, August 2017. (download) (g-scholar)
  5. Yu Zhang, James Z. Wang and Jia Li, ``Parallel Massive Clustering of Discrete Distributions,'' ACM Transactions on Multimedia Computing, Communications and Applications, vol. 11, no. 4, article 49, pp. 49:1-24 and appendix:1-6, April 2015. [A version was posted in Feb 2013 at http://arxiv.org/abs/1302.0435] (download) (g-scholar)
  6. Neela Sawant, James Z. Wang and Jia Li, ``Enhancing Training Collections for Image Annotation: An Instance-Weighted Mixture Modeling Approach,'' IEEE Transactions on Image Processing, vol. 22, no. 9, pp. 3562-3577, 2013. (download) (g-scholar)
  7. Neela Sawant, Jia Li and James Z. Wang, ``Automatic Image Semantic Interpretation using Social Action and Tagging Data,'' Multimedia Tools and Applications, Special Issue on Survey Papers in Multimedia by World Experts, vol. 51, no. 1, pp. 213-246, Springer, 2011. [invited] (download) (g-scholar)
  8. Neela Sawant, Ritendra Datta, Jia Li and James Z. Wang, ``Quest for Relevant Tags using Local Interaction Networks and Visual Content,'' Proceedings of the ACM International Conference on Multimedia Information Retrieval, Special Session on Statistical Modeling and Learning for Multimedia, pp. 231-240, Philadelphia, Pennsylvania, ACM, March 2010. (download) (g-scholar)
  9. Jia Li and James Z. Wang, ``Real-time Computerized Annotation of Pictures,'' IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 30, no. 6, pp. 985-1002, 2008. [An abstract was published in Proc. ACM Multimedia, 2006] (download) (g-scholar)
  10. Ritendra Datta, Dhiraj Joshi, Jia Li and James Z. Wang, ``Tagging over Time: Real-world Image Annotation by Lightweight Meta-learning,'' Proceedings of the ACM Multimedia Conference, pp. 393-402, ACM, Augsburg, Germany, September 2007. (download) (g-scholar)
  11. Jia Li and James Z. Wang, ``Real-time Computerized Annotation of Pictures,'' Proceedings of the ACM Multimedia Conference, pp. 911-920, ACM, Santa Barbara, CA, October 2006. (download) (g-scholar)
  12. Jia Li and James Z. Wang, ``The Automatic Linguistic Indexing of Pictures System,'' Proceedings of the International Conference on Computer Vision and Pattern Recognition, Demonstration, vol. II, pp. 1208-1209, San Diego, CA, IEEE, June 2005. (download) (g-scholar)
  13. Yixin Chen, Jia Li and James Z. Wang, Machine Learning and Statistical Modeling Approaches to Image Retrieval, monograph, Kluwer Academic Publishers, 200 pages, Dordrecht, June 2004. (download) (g-scholar)
  14. Jia Li and James Z. Wang, ``Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach,'' IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, no. 9, pp. 1075-1088, 2003. [An abstract was published in Proc. ACM Multimedia, 2002] (download) (g-scholar)
  15. James Z. Wang, Jia Li and Sui Ching Lin, ``Evaluation Strategies for Automatic Linguistic Indexing of Pictures,'' Proceedings of the IEEE International Conference on Image Processing (ICIP), vol. 3, pp. 617-620, Barcelona, Spain, IEEE, September 2003. [invited but peer-reviewed] (download) (g-scholar)
  16. James Z. Wang and Jia Li, ``Learning-Based Linguistic Indexing of Pictures with 2-D MHMMs,'' Proceedings of the ACM Multimedia Conference, pp. 436-445, Juan Les Pins, France, ACM, December 2002. (download) (g-scholar)
  17. James Z. Wang, Jia Li, Robert M. Gray and Gio Wiederhold, ``Unsupervised Multiresolution Segmentation for Images with Low Depth of Field,'' IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 23, no. 1, pp. 85-90, 2001. [An abstract was published in Proc. IEEE ICIAP, 1999] (download) (g-scholar)
  18. Jia Li, James Z. Wang and Gio Wiederhold, ``Classification of Textured and Non-Textured Images Using Region Segmentation,'' Proceedings of the IEEE International Conference on Image Processing (ICIP), Vancouver, BC, Canada, pp. 754-757, IEEE, September 2000. (download) (g-scholar)
  19. Jia Li, James Z. Wang, Robert M.Gray and Gio Wiederhold, ``Multiresolution Object-of-Interest Detection for Images with Low Depth of Field,'' Proceedings of the IEEE International Conference on Image Analysis and Processing (ICIAP), pp. 32-37, Venice, Italy, IEEE, 1999. (download) (g-scholar)
  20. James Z. Wang and Martin A. Fischler, ``Visual Similarity, Judgmental Certainty and Stereo Correspondence,'' Proceedings of the DARPA Image Understanding Workshop, George Lukes, (ed.), vol. 2, pp. 1237-1248, Monterey, CA, Morgan Kaufmann Publishers, November 1998. (download) (g-scholar)
  21. Image Retrieval
  22. Ritendra Datta, Dhiraj Joshi, Jia Li and James Z. Wang, ``Image Retrieval: Ideas, Influences, and Trends of the New Age,'' ACM Computing Surveys, vol. 40, no. 2, article 5, pp. 5:1-60, April 2008. [An abstract was published in Proc. ACM Multimedia Information Retrieval, 2005] (download) (g-scholar)
  23. Ritendra Datta, Weina Ge, Jia Li and James Z. Wang, ``Toward Bridging the Annotation-Retrieval Gap in Image Search,'' IEEE MultiMedia, vol. 14, no. 3, pp. 24-35, 2007. [An abstract was published in Proc. ACM Multimedia, 2006] (download) (g-scholar)
  24. Ritendra Datta, Weina Ge, Jia Li and James Z. Wang, ``Toward Bridging the Annotation-Retrieval Gap in Image Search by a Generative Modeling Approach,'' Proceedings of the ACM Multimedia Conference, pp. 977-986, ACM, Santa Barbara, CA, October 2006. (download) (g-scholar)
  25. Dhiraj Joshi, Ritendra Datta, Ziming Zhuang, WP Weiss, Marc Friedenberg, James Z. Wang and Jia Li, ``PARAgrab: A Comprehensive Architecture for Web Image Management and Multimodal Querying,'' Proceedings of the International Conference on Very Large Data Bases, Demonstration, pp. 1163-1166, Seoul, Korea, September 2006. (download) (g-scholar)
  26. Ritendra Datta, Jia Li and James Z. Wang, ``Content-Based Image Retrieval - Approaches and Trends of the New Age,'' Proceedings of the 7th ACM International Workshop on Multimedia Information Retrieval, in conjunction with ACM International Conference on Multimedia, pp. 253-262, Singapore, ACM, November 2005. (download) (g-scholar)
  27. Xiaonan Lu, Jia Li and James Z. Wang, ``Learning Representative Objects from Images Using Quadratic Optimization,'' Proceedings of the Second International Conference on Machine Intelligence, co-located with the UN World Summit on the Information Society, invited for a special session, pp. 730-737, Tozeur, Tunisia, ACIDCA, November 2005. [invited] (download) (g-scholar)
  28. Ashish Parulekar, Ritendra Datta, Jia Li and James Z. Wang, ``Large-scale Satellite Image Browsing using Automatic Semantic Categorization and Content-based Retrieval,'' Proceedings of the IEEE International Workshop on Semantic Knowledge in Computer Vision, in conjunction with IEEE International Conference on Computer Vision, 8 pages, abstract on pp. 1873 of Proc. ICCV Workshops, Beijing, China, IEEE, October 2005. (download) (g-scholar)
  29. Yixin Chen and James Z. Wang, ``Looking Beyond Region Boundaries: A Robust Image Similarity Measure Using Fuzzified Region Features,'' Proceedings of the IEEE International Conference on Fuzzy Systems, pp. 1165-1170, St. Louis, MO, 2003. [invited but peer-reviewed] (download) (g-scholar)
  30. Yixin Chen and James Z. Wang, ``A Region-Based Fuzzy Feature Matching Approach to Content-Based Image Retrieval,'' IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 9, pp. 1252-1267, 2002. [An abstract was published in Proc. ACM Multimedia, 2001] (download) (g-scholar)
  31. James Z. Wang, Jia Li and Gio Wiederhold, ``SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture Libraries,'' IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 23, no. 9, pp. 947-963, 2001. [An abstract was published in D-LIB, 1999] (download) (g-scholar)
  32. Yanping Du and James Z. Wang ``A Scalable Integrated Region-Based Image Retrieval System,'' Proceedings of the IEEE International Conference on Image Processing (ICIP), Thessaloniki, Greece, pp. 22-25, IEEE, October 2001. (download) (g-scholar)
  33. Yixin Chen, James Z. Wang and Jia Li, ``FIRM: Fuzzily Integrated Region Matching for Content-Based Image Retrieval,'' Proceedings of the ACM Multimedia Conference, pp. 543-545, Ottawa, Canada, ACM, September 2001. (download) (g-scholar)
  34. James Z. Wang and Yanping Du, ``RF*IPF: A Weighting Scheme for Multimedia Information Retrieval,'' Proceedings of the IEEE International Conference on Image Analysis and Processing (ICIAP), pp. 380-385, Palermo, Italy, IEEE, 2001. (download) (g-scholar)
  35. James Z. Wang and Yanping Du, ``Scalable Integrated Region-Based Image Retrieval Using IRM and Statistical Clustering,'' Proceedings of the ACM and IEEE Joint Conference on Digital Libraries, pp. 268-277, Roanoke, VA, ACM, June 2001. (download) (g-scholar)
  36. James Z. Wang, Integrated Region-Based Image Retrieval, monograph, Kluwer Academic Publishers, 190 pages, Dordrecht, 2001. (download) (g-scholar)
  37. Yixin Chen and James Z. Wang, ``Looking Beyond Region Boundaries: Region-Based Image Retrieval Using Fuzzy Feature Matching,'' Proceedings of the Multimedia Content-Based Indexing and Retrieval Workshop, INRIA Rocquencourt, France, pp. 37-40, September 2001. (download) (g-scholar)
  38. James Z. Wang, Jia Li and Gio Wiederhold, ``SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture Libraries,'' Lecture Notes in Computer Science, Advances in visual information systems, Lyon, France, Robert Laurini (ed.), vol. 1929, pp. 360-371, Springer-Verlag, November 2000. [Long version of this paper is published in IEEE Trans. PAMI.] (download) (g-scholar)
  39. James Z. Wang, ``SIMPLIcity: A Region-Based Image Retrieval System for Picture Libraries and Biomedical Image Databases,'' Proceedings of the ACM Multimedia Conference, pp. 483-484, Los Angeles, CA, ACM, October 2000. (download) (g-scholar)
  40. Jia Li, James Z. Wang and Gio Wiederhold, ``IRM: Integrated Region Matching for Image Retrieval,'' Proceedings of the ACM Multimedia Conference, pp. 147-156, Los Angeles, CA, ACM, October 2000. (download) (g-scholar)
  41. James Z. Wang, Semantics-Sensitive Integrated Matching for Picture Libraries and Biomedical Image Databases, Stanford University Ph.D. Thesis, UMI Publisher, 221 pages, August 2000. [Thesis. Copyright held by the author.] (download) (g-scholar)
  42. James Z. Wang, Jia Li, Desmond Chan and Gio Wiederhold, ``Semantics-Sensitive Retrieval for Digital Picture Libraries,'' D-LIB Magazine, vol. 5, no. 11, DOI: 10.1045/november 99-wang, CNRI, November 1999. http://www.dlib.org [Invited but peer-reviewed. Copyright held by authors.] (download) (g-scholar)
  43. James Z. Wang, Gio Wiederhold, Oscar Firschein and Sha Xin Wei, ``Content-Based Image Indexing and Searching Using Daubechies' Wavelets,'' International Journal on Digital Libraries, vol. 1, no. 4, pp. 311-328, Springer-Verlag, 1998. [An abstract was presented in ADL 1997 and was selected as one of the best papers] (download) (g-scholar)
  44. Edward Chang, James Z. Wang, Chen Li and Gio Wiederhold, ``RIME: A Replicated Image Detector for the World-Wide Web,'' Proceedings of SPIE, vol. 3527, pp. 58-67, Boston, MA, November 1998. (download) (g-scholar)
  45. James Z. Wang, Gio Wiederhold, Oscar Firschein and Sha Xin Wei, ``Wavelet-Based Image Indexing Techniques With Partial Sketch Retrieval Capability,'' Proceedings of the IEEE Forum on Research and Technology Advances in Digital Libraries (ADL'97), pp. 13-24, Washington, D.C., IEEE, May 1997. (download) (g-scholar)
  46. Image Categorization and Clustering
  47. Yixin Chen, Jinbo Bi and James Z. Wang, ``MILES: Multiple-Instance Learning via Embedded Instance Selection,'' IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 12, pp. 1931-1947, 2006. [An abstract was published in Proc. CVPR, 2005] (download) (g-scholar)
  48. Yixin Chen, James Z. Wang and Robert Krovetz, ``CLUE: Cluster-based Retrieval of Images by Unsupervised Learning,'' IEEE Transactions on Image Processing, vol. 14, no. 8, pp. 1187-1201, 2005. [An abstract was published in Proc. IEEE International Symposium on Signal Processing and its Applications, 2003] (download) (g-scholar)
  49. Jinbo Bi, Yixin Chen and James Z. Wang, ``A Sparse Support Vector Machine Approach to Region-Based Image Categorization,'' Proceedings of the International Conference on Computer Vision and Pattern Recognition, vol. I, pp. 1121-1128, San Diego, CA, IEEE, June 2005. (download) (g-scholar)
  50. Yixin Chen and James Z. Wang, ``Image Categorization by Learning and Reasoning with Regions,'' Journal of Machine Learning Research, vol. 5, 913-939, August 2004. (download) (g-scholar)
  51. Yixin Chen and James Z. Wang, ``Support Vector Learning for Fuzzy Rule-Based Classification Systems,'' IEEE Transactions on Fuzzy Systems, vol. 11, no. 6, pp. 716-728, 2003. [An abstract was published in Proceedings of the IEEE Fuzzy Systems, 2003] (download) (g-scholar)
  52. Yixin Chen, James Z. Wang and Robert Krovetz, ``An Unsupervised Learning Approach to Content-Based Image Retrieval,'' Proceedings of the IEEE International Symposium on Signal Processing and its Applications, vol. 1, pp. 197-200, Paris, France, July 2003. [invited but peer-reviewed] (download) (g-scholar)
  53. Yixin Chen and James Z. Wang, ``Kernel Machines and Additive Fuzzy Systems: Classification and Function Approximation,'' Proceedings of the IEEE International Conference on Fuzzy Systems, pp. 789-795, St. Louis, MO, 2003. (download) (g-scholar)
  54. Yixin Chen, James Z. Wang and Robert Krovetz, ``Content-Based Image Retrieval by Clustering,'' Proceedings of the 5th ACM International Workshop on Multimedia Information Retrieval, in conjunction with ACM Multimedia, pp. 193-200, Berkeley, CA, ACM, November 2003. (download) (g-scholar)
  55. Edward Chang, Chen Li, James Z. Wang, Peter Mork and Gio Wiederhold, ``Searching Near-Replicas of Images Via Clustering,'' Proceedings of SPIE, vol. 3846, pp. 281-292, Boston, MA, September 1999. (download) (g-scholar)
  56. Weather and Remote Sensing
  57. Xinye Zheng, Jianbo Ye, Yukun Chen, Stephen Wistar, Jia Li, Jose A. Piedra-Fernandez, Michael A. Steinberg and James Z. Wang, ``Detecting Comma-shaped Clouds for Severe Weather Forecasting using Shape and Motion,'' IEEE Transactions on Geoscience and Remote Sensing, vol. 57, no. 6, pp. 3788-3801, 2019. [A version was posted in February 2018 at http://arxiv.org/abs/1802.08937] (download) (g-scholar)
  58. Mohammad M. Kamani, Sadegh Farhang, Mehrdad Mahdavi and James Z. Wang, ``Targeted Meta-Learning for Critical Incident Detection in Weather Data,'' Proceedings of the Workshop at the International Conference on Machine Learning, Climate Change: How Can AI Help?, 4 pages, Long Beach, California, June 2019. (download) (g-scholar)
  59. Mohammad Mahdi Kamani, Farshid Farhat, Stephen Wistar and James Z. Wang, ``Skeleton Matching with Applications in Severe Weather Detection,'' Applied Soft Computing, vol. 70, pp. 1154-1166, Elsevier, 2018. (download) (g-scholar)
  60. Yu Zhang, Stephen Wistar, Jia Li, Michael A. Steinberg and James Z. Wang, ``Severe Thunderstorm Detection by Visual Learning Using Satellite Images,'' IEEE Transactions on Geoscience and Remote Sensing, vol. 55, no. 2, pp. 1039-1052, 2017. [A version was posted in March 2016 at http://arxiv.org/abs/1603.00146] (download) (g-scholar)
  61. Moises Espinola, Jose A. Piedra-Fernandez, Rosa Ayala, Luis Iribarne, Saturnino Leguizamon and James Z. Wang, ``Simulating Rainfall, Water Evaporation and Groundwater Flow in Three-Dimensional Satellite Images with Cellular Automata,'' Simulation Modelling Practice and Theory, vol. 67, pp. 89-99, Elsevier, 2016. (download) (g-scholar)
  62. Mohammad Mahdi Kamani, Farshid Farhat, Stephen Wistar and James Z. Wang, ``Shape Matching using Skeleton Context for Automated Bow Echo Detection,'' Proceedings of the IEEE Big Data Conference, pp. 901-908, Washington, D.C., December 2016. (download) (g-scholar)
  63. Moises Espinola, Jose A. Piedra-Fernandez, Rosa Ayala, Luis Iribarne and James Z. Wang, ``Contextual and Hierarchical Classification of Satellite Images Based on Cellular Automata,'' IEEE Transactions on Geoscience and Remote Sensing, vol. 53, no. 2, pp. 795-809, 2015. (download) (g-scholar)
  64. Jose A. Piedra-Fernandez, Gloria Ortega, James Z. Wang and M. Canton-Garbin, ``Fuzzy Content-Based Image Retrieval for Oceanic Remote Sensing,'' IEEE Transactions on Geoscience and Remote Sensing, vol. 52, no. 9, pp. 5422-5431, 2014. (download) (g-scholar)
  65. Yu Zhang, Stephen Wistar, Jose A. Piedra-Fernandez, Jia Li, Michael A. Steinberg and James Z. Wang, ``Locating Visual Storm Signatures from Satellite Images,'' Proceedings of the IEEE Big Data Conference, pp. 711-720, Washington, D.C., October 2014. (download) (g-scholar)
  66. Qingyong Li, Weitao Lu, Jun Yang and James Z. Wang, ``Thin Cloud Detection of All-Sky Images Using Markov Random Fields,'' IEEE Geoscience and Remote Sensing Letters, vol. 9, no. 3, pp. 417-421, 2012. [10.1109/LGRS.2011.2170953] (download) (g-scholar)
  67. Jose A. Piedra-Fernandez, M. Canton-Garbin and James Z. Wang, ``Feature Selection in AVHRR Ocean Satellite Images by Means of Filter Methods,'' IEEE Transactions on Geoscience and Remote Sensing, vol. 48, no. 12, pp. 4193-4203, December 2010. (download) (g-scholar)

In the midst of difficulty lies opportunity. - Albert Einstein

COPYRIGHT © 2000- James Z. Wang Research Group, Penn State, University Park .