Download Page
The Arabic Parallel Gender Corpus

Summary

The Arabic Parallel Gender Corpus is a corpus designed to support research on gender bias in natural language processing applications working on Arabic. The corpus includes multiple parts with different features. In this release, we share the data presented in the 2019 paper on "Automatic Gender Identification and Reinflection in Arabic" by Habash et al. in the First workshop on Gender Bias in Natural Language Processing.

This resource was developed at the Computational Approaches to Modeling Language (CAMeL) Lab in New York University Abu Dhabi.

Project Team

Publications

Download

By downloading the The Arabic Parallel Gender Corpus files from HERE you agree to the terms of the license below.

//////////////////////////////////////////////////////////////////////////////
// License for The Arabic Parallel Gender Corpus
//////////////////////////////////////////////////////////////////////////////

Copyright 2019 New York University Abu Dhabi. All Rights Reserved. A license to use and copy this software, data and its documentation solely for your internal research and evaluation purposes, without fee and without a signed licensing agreement, is hereby granted upon your download of the software, through which you agree to the following: 1) the above copyright notice, this paragraph and the following three paragraphs will prominently appear in all internal copies and modifications; 2) no rights to sublicense or further distribute this software are granted; 3) no rights to modify this software are granted; and 4) no rights to assign this license are granted. Please Contact the Office of Industrial Liaison, New York University, One Park Avenue, 6th Floor, New York, NY 10016 (212) 263-8178, for commercial licensing opportunities, or for further distribution, modification or license rights.

Created by Nizar Habash and Christine Chung at the Computational Approaches to Modeling Language (CAMeL) Lab in New York University Abu Dhabi.

IN NO EVENT SHALL NYU, OR ITS EMPLOYEES, OFFICERS, AGENTS OR TRUSTEES ("COLLECTIVELY "NYU PARTIES") BE LIABLE TO ANY PARTY FOR DIRECT, INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES OF ANY KIND , INCLUDING LOST PROFITS, ARISING OUT OF ANY CLAIM RESULTING FROM YOUR USE OF THIS SOFTWARE, DATA AND ITS DOCUMENTATION, EVEN IF ANY OF NYU PARTIES HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH CLAIM OR DAMAGE.

NYU SPECIFICALLY DISCLAIMS ANY WARRANTIES OF ANY KIND REGARDING THE SOFTWARE and DATA, INCLUDING, BUT NOT LIMITED TO, NON-INFRINGEMENT, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE, OR THE ACCURACY OR USEFULNESS, OR COMPLETENESS OF THE SOFTWARE. THE SOFTWARE AND ACCOMPANYING DOCUMENTATION, IF ANY, PROVIDED HEREUNDER IS PROVIDED COMPLETELY "AS IS". REGENTS HAS NO OBLIGATION TO PROVIDE FURTHER DOCUMENTATION, MAINTENANCE, SUPPORT, UPDATES, ENHANCEMENTS, OR MODIFICATIONS.

Please cite Habash et al. (2019) if you use The Arabic Parallel Gender Corpus in your research:
Habash, Nizar, Houda Bouamor, Christine Chung. 2019. Automatic Gender Identification and Reinflection in Arabic. In Proceedings of the First Workshop on Gender Bias in Natural Language Processing, Florence, Italy.

//////////////////////////////////////////////////////////////////////////////


CAMeL Lab Resources
CAMeL Lab