rishabmishrasensation/realtor-conversational-portuguese_br
收藏Hugging Face2026-04-27 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/rishabmishrasensation/realtor-conversational-portuguese_br
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: messages
list:
- name: role
dtype: string
- name: content
dtype: string
splits:
- name: train
num_examples: 1
- name: test
num_examples: 2
download_size: 0
dataset_size: 0
configs:
- config_name: default
data_files:
- split: train
path: "data/train-*.json"
- split: test
path: "data/test-*.json"
- split: valid
path: "data/valid-*.json"
license: apache-2.0
task_categories:
- text2text-generation
- text-generation
language:
- pt
pretty_name: ' Realtor Conversational (portuguese br)'
size_categories:
- 1K<n<10K
---
# **Realtor Conversational (Portuguese BR) - Realtor-Client Conversation Dataset**
## **Detailed Dataset Description**
**Introduction:**
This dataset, named "Realtor Conversational (Portuguese BR)", offers rich and detailed simulations of conversational interactions between real estate agents (realtors) and clients in Brazil. Generated using the advanced language model `gpt-4o-mini`, the data is synthetic but designed to mirror the dynamics, vocabulary, and common scenarios found in the process of buying, selling, and renting properties within the Brazilian market.
**Generation Methodology:**
The data was synthetically created with the goal of providing a robust and contextually relevant corpus for Natural Language Processing (NLP) tasks and the development of conversational models focused on the real estate sector. The generation aimed to simulate realistic dialogues, covering a wide range of pertinent topics.
**Dataset Structure:**
* **Format:** The dataset consists of multiple complete conversations.
* **Participants:** Each conversation simulates a dialogue between two defined roles: a Real Estate Agent (Realtor) and a Client.
* **Structure per Conversation:**
* Each individual conversation contains exactly **100 messages**.
* The message distribution is balanced: **50 messages attributed to the Realtor** and **50 messages attributed to the Client**.
* The average length of each complete conversation is approximately **1000 words**, ensuring depth and detail in the dialogues.
* **Context:** All interactions are strictly set within the **context of the Brazilian real estate market**. This includes references to:
* Specific legislation (e.g., Tenancy Law [Lei do Inquilinato], Civil Code applied to real estate).
* Economic conditions (SELIC interest rates, inflation indices - IPCA/IGP-M).
* Financial and banking institutions operating in Brazil.
* Civil construction practices and common architectural features in the country.
* Dynamics and regional specificities of the Brazilian real estate market.
**Main Topics Covered:**
The conversations were carefully crafted to cover a broad and relevant spectrum of topics within the Brazilian real estate domain. The 20 main thematic axes include (but are not limited to):
1. **Real Estate Financing:** Detailed process, types/modalities (SBPE, FGTS, Minha Casa Minha Vida), rates (nominal and effective interest - CET), amortization systems (SAC, Price), rules for using FGTS (a Brazilian severance indemnity fund), down payment calculation, credit analysis, required documentation, comparisons between banks, and mortgage portability.
2. **Property Rentals (Leasing):** Contract drafting and analysis, application of the Tenancy Law (Lei 8.245/91), types of rental guarantees (guarantor, surety bond insurance, security deposit, capitalization bond), rent adjustment indices and rules (IGPM/IPCA), inspection processes (move-in/move-out), rights and duties of landlords and tenants, property search strategies, specifics of short-term/vacation rentals, and the use of digital platforms.
3. **Buying and Selling Process:** Transaction stages (from offer to deed), complete documentation (buyer, seller, property), types of contracts (promissory/commitment to buy and sell, public deed), breakdown of costs (ITBI [Property Transfer Tax], notary fees, registration), negotiation techniques, and the role of the realtor versus direct sales.
4. **Real Estate Investment:** Strategies (passive income from rent, appreciation and flipping), types (physical properties, Real Estate Investment Funds - FIIs, real estate crowdfunding), profitability analysis (Cap Rate, ROI), risk assessment, portfolio diversification, and comparison with other financial asset classes.
5. **Legal and Juridical Aspects:** Interpretation of relevant legislation (Civil Code, Tenancy Law, Urban Zoning Laws, Condominium Law), property regularization processes (obtaining Certificate of Occupancy [Habite-se], adverse possession [usucapião], REURB - Urban Land Regularization), analysis of contract clauses, resolution of legal disputes, identification and handling of latent/hidden defects, and legal responsibilities of the parties.
6. **Property Valuation and Appreciation:** Technical valuation methods (Direct Market Data Comparison, Income Approach, etc.), factors influencing value (location, condition, market trends), preparation of technical appraisal reports, impact of renovations and improvements on value, and market appreciation trend analysis.
7. **Preventive Maintenance and Home Repairs:** Establishing maintenance routines, diagnosing common problems (leaks, seepage, electrical issues, plumbing problems), evaluating DIY repairs versus hiring specialized professionals, managing unexpected costs, and maintenance checklists.
8. **Neighborhood Choice and Lifestyle Fit:** Objective and subjective selection criteria (public safety, transport access, school quality, leisure and commerce options, green spaces), analysis of neighborhood profile, walkability concepts, noise levels, and field research.
9. **Emotional and Psychological Aspects of Housing:** The cultural significance of the "dream of homeownership," managing stress and anxiety during the process, common fears (debt, making a bad deal), impact on family relationships, cognitive biases in decision-making, post-move adaptation process, and building an affective relationship with the home.
10. **Condominium Management:** Roles and responsibilities of the building manager (resident or professional - *síndico*) and the administration company, financial management (collection of condo fees, delinquency control, budget preparation), conducting condo/homeowner meetings (assemblies), application of rules (condominium charter, internal regulations), and maintenance of common areas.
11. **Property Management and Rental Administration:** Services offered by real estate agencies and administration companies, fee structures, control of financial receipts and transfers, management of contracts and tenant relationships, coordination of maintenance for rented properties, and use of real estate management software.
12. **Taxation in the Real Estate Market:** Details of taxes levied on transactions and ownership (ITBI - Property Transfer Tax, Capital Gains Tax - IR, ITCMD - Inheritance and Donation Tax, IPTU - Urban Property Tax), calculation methods, exemption possibilities, and rules for declaration in Income Tax returns (Individual and Corporate).
13. **Essential Real Estate Documentation:** Explanation of the importance and process of obtaining crucial documents such as the updated Property Registration Certificate (Matrícula do Imóvel), Public Deed of Purchase and Sale, Clearance Certificates (Negative Certificates - for liens, debts), and Contracts (Purchase and Sale, Lease).
14. **Comparison between Acquisition/Housing Modalities:** In-depth analysis of the advantages and disadvantages between Real Estate Financing, Real Estate Consortium (a specific Brazilian group purchasing model), Renting, and Outright Purchase, considering factors like total costs, terms, flexibility, risks, and suitability to the consumer's financial and life profile.
15. **Use of FGTS in the Real Estate Sector:** Detailed rules and possibilities for using the balance of the Severance Indemnity Fund (FGTS) for purchasing residential property, paying part of the down payment, amortizing the outstanding balance of financing, or paying installments.
16. **Amortization Systems (SAC vs. Price):** Detailed explanation of how the Constant Amortization System (SAC) and the Price Table system work, with comparisons of installment value evolution over time and the impact on the total cost of financing.
17. **Credit Analysis and Financing Approval:** Criteria used by financial institutions to approve real estate credit, importance of the credit score (Serasa, Boa Vista), methods of income verification (formal and informal), and calculation of payment capacity (debt-to-income ratio).
18. **Common Problems in Properties:** Identification, diagnosis, and suggested solutions for frequent issues such as infiltration/seepage, leaks (plumbing), electrical faults, structural issues (cracks, fissures), urban pest control, and other construction or maintenance defects.
19. **Real Estate Market (Trends and Projections):** Analysis of market cycles (boom, bust, stability), projections for interest rates and real estate credit, impact of macroeconomic factors (GDP, employment, inflation), and discussion of emerging trends (new property typologies, appreciation of specific locations, focus on sustainability).
20. **Rights and Duties of the Parties:** Clarification of the legal obligations and rights of each party involved in real estate transactions and relationships: buyers, sellers, landlords, tenants, and condominium owners.
**Potential Applications:**
This dataset is a valuable resource for:
* Training and fine-tuning Large Language Models (LLMs) and chatbots specialized in the real estate sector.
* Research in Natural Language Processing (NLP), such as sentiment analysis, intent recognition, information extraction, and topic modeling.
* Development of dialogue systems and virtual assistants for realtors and real estate agencies.
* Scenario simulation for training real estate agents.
* Analysis of communication and negotiation patterns in the Brazilian real estate market.
**Language:**
Brazilian Portuguese (pt-BR).
**Considerations:**
As a synthetic dataset, although designed to be realistic, it might not capture 100% of the nuances, slang, or extreme regional variations of real human communication. However, it provides a solid and structured foundation for research and development.
## **Citation**
If you use this dataset in your research or work, please cite it as follows:
```bibtex
@misc{tlrrealtor_conversational_ptbr_2025,
author = {Rodrigues, Thiago Luiz},
title = {{Realtor Conversational (Portuguese BR) - Realtor-Client Conversation Dataset}},
month = {April},
year = {2025},
howpublished = {https://huggingface.co/datasets/bobboyms/realtor-conversational-portuguese_br},
note = {Dataset of synthetic conversations simulating interactions between realtors and clients in the Brazilian real estate market.}
}
```
提供机构:
rishabmishrasensation



