mirror of
https://github.com/raspberrypi/linux.git
synced 2026-01-02 07:43:34 +00:00
hugetlb: don't delete vma_lock in hugetlb MADV_DONTNEED processing
commit04ada095dcupstream. madvise(MADV_DONTNEED) ends up calling zap_page_range() to clear page tables associated with the address range. For hugetlb vmas, zap_page_range will call __unmap_hugepage_range_final. However, __unmap_hugepage_range_final assumes the passed vma is about to be removed and deletes the vma_lock to prevent pmd sharing as the vma is on the way out. In the case of madvise(MADV_DONTNEED) the vma remains, but the missing vma_lock prevents pmd sharing and could potentially lead to issues with truncation/fault races. This issue was originally reported here [1] as a BUG triggered in page_try_dup_anon_rmap. Prior to the introduction of the hugetlb vma_lock, __unmap_hugepage_range_final cleared the VM_MAYSHARE flag to prevent pmd sharing. Subsequent faults on this vma were confused as VM_MAYSHARE indicates a sharable vma, but was not set so page_mapping was not set in new pages added to the page table. This resulted in pages that appeared anonymous in a VM_SHARED vma and triggered the BUG. Address issue by adding a new zap flag ZAP_FLAG_UNMAP to indicate an unmap call from unmap_vmas(). This is used to indicate the 'final' unmapping of a hugetlb vma. When called via MADV_DONTNEED, this flag is not set and the vm_lock is not deleted. NOTE - Prior to the introduction of the huegtlb vma_lock in v6.1, this issue is addressed by not clearing the VM_MAYSHARE flag when __unmap_hugepage_range_final is called in the MADV_DONTNEED case. [1] https://lore.kernel.org/lkml/CAO4mrfdLMXsao9RF4fUE8-Wfde8xmjsKrTNMNC9wjUb6JudD0g@mail.gmail.com/ Link: https://lkml.kernel.org/r/20221114235507.294320-3-mike.kravetz@oracle.com Fixes:90e7e7f5ef("mm: enable MADV_DONTNEED for hugetlb mappings") Signed-off-by: Mike Kravetz <mike.kravetz@oracle.com> Reported-by: Wei Chen <harperchen1110@gmail.com> Cc: Axel Rasmussen <axelrasmussen@google.com> Cc: David Hildenbrand <david@redhat.com> Cc: Matthew Wilcox <willy@infradead.org> Cc: Mina Almasry <almasrymina@google.com> Cc: Nadav Amit <nadav.amit@gmail.com> Cc: Naoya Horiguchi <naoya.horiguchi@linux.dev> Cc: Peter Xu <peterx@redhat.com> Cc: Rik van Riel <riel@surriel.com> Cc: Vlastimil Babka <vbabka@suse.cz> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Mike Kravetz <mike.kravetz@oracle.com> Signed-off-by: Sasha Levin <sashal@kernel.org>
This commit is contained in:
committed by
Greg Kroah-Hartman
parent
5450535901
commit
bb8f66f6af
25
mm/hugetlb.c
25
mm/hugetlb.c
@@ -5145,17 +5145,20 @@ void __unmap_hugepage_range_final(struct mmu_gather *tlb,
|
||||
{
|
||||
__unmap_hugepage_range(tlb, vma, start, end, ref_page, zap_flags);
|
||||
|
||||
/*
|
||||
* Clear this flag so that x86's huge_pmd_share page_table_shareable
|
||||
* test will fail on a vma being torn down, and not grab a page table
|
||||
* on its way out. We're lucky that the flag has such an appropriate
|
||||
* name, and can in fact be safely cleared here. We could clear it
|
||||
* before the __unmap_hugepage_range above, but all that's necessary
|
||||
* is to clear it before releasing the i_mmap_rwsem. This works
|
||||
* because in the context this is called, the VMA is about to be
|
||||
* destroyed and the i_mmap_rwsem is held.
|
||||
*/
|
||||
vma->vm_flags &= ~VM_MAYSHARE;
|
||||
if (zap_flags & ZAP_FLAG_UNMAP) { /* final unmap */
|
||||
/*
|
||||
* Clear this flag so that x86's huge_pmd_share
|
||||
* page_table_shareable test will fail on a vma being torn
|
||||
* down, and not grab a page table on its way out. We're lucky
|
||||
* that the flag has such an appropriate name, and can in fact
|
||||
* be safely cleared here. We could clear it before the
|
||||
* __unmap_hugepage_range above, but all that's necessary
|
||||
* is to clear it before releasing the i_mmap_rwsem. This works
|
||||
* because in the context this is called, the VMA is about to
|
||||
* be destroyed and the i_mmap_rwsem is held.
|
||||
*/
|
||||
vma->vm_flags &= ~VM_MAYSHARE;
|
||||
}
|
||||
}
|
||||
|
||||
void unmap_hugepage_range(struct vm_area_struct *vma, unsigned long start,
|
||||
|
||||
Reference in New Issue
Block a user